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© Cloned genes encoding IG-CD4 fusion proteins and the use thereof. 

. ■ i* _| fha ,- M , nG1 or | Q G3 class, wherein the variable region of the light or 

© Fusion proteins of immunoglobuhns of *° or gG3 J ^ ^ jmmunog|obulin . 

heavy chain has been replaced with CD4 or fragments tnereoi cap « hain can be 

like Lecu.es comprising such ^^J^Z^ lZ^^^ « H ' V °' ™ 

Z^Z^^^Z^ ™ - ^ 9P12 ° ^ the 

molecule or fusion protein, and detecting whether a complex is formed. 
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CLONED GENES ENCODING IG-CD4 FUSION PROTEJNS AND THE USE THEREOF 

CROSS-REFERENCE TO RELATED APPLICATION 

^Jhis application is a continuation-in-part of U.S. Application Seria, No. 07/147.351 filed January 22. 

FIELD OF THE INVENTION 

The invention is in the field of recombinant genetics. 

BACKGROUND OF THE INVENTION 

? 6 ^ 9,yCOpr0tein ' 9P 120 * hich bi "<* «o the-CD4 pToleTn^sent on the surface of hele T 
lymphocytes, macrophages and other cells. Da.gleish et al. Nature. 3,2:763 (1984). After h > »i» binds tl 

E™ r'— ^ EE 

Zi h 6 m ° St VeXi " 9 3nd al3rmin9 3Spec,s 01 the in,ec *°"- augurs poorly for vacc nation 

efforts based upon conventional approaches. vaccination 

Rrs/ZmhrR^ P ' ay 3 r °' e in ineHiCaCV °' ,hS hum0ra ' res P° nse t0 immunodeficiency viruses 

few eS Pes 2£E £ "tSTS?. 0 ^T^" 1 * ,hemSe,V6S 3,8 heaV " y -o.ecu.es presenting 

lew epitopes suitable for h.gh affinity ant.body binding. The poorly antigenic, "moving" taroet which the viral 

"orcti e on PreSen,S - a " 0VVS ^ h ° St ' itt,e ° PP ° n ^ *° r ting 9 vifal Section' by CSlc an.Ibody 

Cells infected by the HIV virus express the 9P 120 glycoprotein on their surface. Gp,20 mediates fusion 

to the Z"? , 3 reaCti ° n Sim " ar ,0 ,hat by whiCh ,he «™ •"»'» *• -'n ectTd e Heading 

LTlnn 3 T on 6d mul,inucleated cells. Syncytium form|tion is dependent on a *2 

L^rTTTSm , n ): McDou9al - J S " 2! 2!: Sci ^ce . 231:382 (1986): Sodros"^. et al.. Nature 
~Th« rnf ^ t ' : - HF 25 <1986TSodro^r J. et a... Nature. 321:4,2 0986 ~ ' 

The CD4 protein cons.sts of a 370 ammo acid extrace.lu.ar region-c^tffi^fo^mmunoglobulin-Bke 

Other workers have shown that cel. lines, which were non-infective for HIV, were convened to mTecTabS 
line^fo-owing transfection and expression of the human CD4 cDNA gene. Maddon « J w££i 

In contrast to the majority of antibody-envelope interactions, the receptor-envelcpe interaction is 

IZT^TJVoT T y <K " = ,08Lm0,e, immU,ab ' e aSS0Cia,i °"- M — ^°^us fo 
MHC c. . TLZnl'] °' ma9m,Ude hi9her ,h3n ,he 3f1ini,y °' CD4 '° r i,S pu,a " ve endogenous ligand. the 
Inrn. h ^ '9ens. indeed, to date, a specific physical association between monomenc CD4 and Cass II 

antigens has not been demonstrated. 

th,. h nH?° nSe ? baC,erial ° f °' her PaniC ' e ' n,ec,ion - ,he host 0f 9 anism usua,, y Produces serum antibodies 
that bind to specific proteins or carbohydrates on the bacterial or panicle surface coating the bacteria Th" 

by an„bod y -dependent cellular tox.ci.y (ADCC). Other serum proteins, co.lectively ca „ed Complement ( C 
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bind to antibody-coated targets, and also can coat foreign particles nonspecilical.y. They cause cell death 
b lysis SLate ingestion by binding to specific receptors on the mac ' o P ha ^/^ d J 1 °^7 0 e 8 n i 
receptors. See Darnell J. et aL in Molecular Cell Biology . Scientific Amencan Books, pp. 641 and 108, 

(l98 ^he most effective complement activating classes of human Ig are IgM and lgG1. The complement 
svstlm consists of 14 proteins that, acting in.order. cause .ysis of ce.ls. Nearly all of the C prote.ns ex.st ,n 
nSS serum as inactive precursors, w'hen activated, some become highly specific pro.eo.yt.c enzymes 
whose substrate is the next protein in a sequential chain reaction. 

The en ire C sequence can be triggered by either of two initiation pathways. In one (the Oass.c 
pathway) Ab Ag complexes bind and activate C1. C4 and C2 to form a C3-splitting enzyme. In the second 
pa hZay polysaccharides commonly on the surface of many bacteria and fungi b.nd w.th trace amounts of 
Tc3 fragment and then w.th two. other proteins (factor B and properdin) to form another C3-sp.,tt.ng 
enzvme Once C3 is split by either pathway, the way is open for the remaining sequence of steps wh.ch 
Tad To cel. "sis See Davis B.D.. et aL .n Microbiology . 3rd ed.. Harper and Row. Ph.ladelph.a. PA. pp. 

^t^um^of workers have disclosed methods for preparing hybrid proteins. For example Murphy 
United States Paten, 4.675.382 (1987). discloses the use of recombinant DNA techniques to make .hybrid 
protein molecules by forming the desired fused gene coding for a hybrid prote.n of d.pthena tox.n and a 
noivoeDtide lioand such as a hormone, followed by expression of the fused gene. 

P *^wXE have prepared monoclona. antibodies (Mabs) by recombinant DNA techniques. Mon- 
oconal antTbodfes are highly specific well-characterized molecules in both primary and ternary structure. 
They ave been widely used for in vitro immunochemical characterization and quant.tat.on of ant-gens^ 
Genes for heavy and light chains hlve"bien introduced into appropriate hosts and expressed, followed by 
feagoreoation oi the individual chains into functional antibody molecules (see. for 

S ? 984); Morrison. S.L.. Science 229:1202 (1985): Oi et aL. Biotechniques 4:214 (1986): Wood et aL 
FiaTure 314 446-449 (1985)). LigW^nd-hTavy-chain variable regions have been cloned and expressed ,n 
££?„ ^oltTwherein they maintained their binding ability (Moore et aL European Patent Apphcation 
0088994 (published September 21. 1983)). 

Chimeric or hybrid antibodies have also been prepared by recomb.nant DNA techniques^ O. and 
Mornson. Biotechniques 4:214 (1986) describe a strategy for producing such ch.menc ant.bod.es wh.ch 
include a chimeric humanlgG anti-ieu3 antibody. 

G«co'gne NRJ et aL Proc. Natl. Acad. Sci. (USA) 84:2936-2940 (1987) disclose the preparation of a 
chime ^.rge'ne construcrco-talnlni ITc^»Tci^v»™. (V) domain and the constant C) reg.on 
codTng sequence of an .mmunog.obulin 7 2a molecule. Cells transfected w.th the ch.mer.c gene synth ,* a 
protein product that expresses immunoglobulin and T-cel. receptor antigenic determinants as we as prote n 
A Cdi^g sites. This protein associates with a norma. X chain to form an apparently norma, tetramenc 
/mi, W here H = heavy and L = light) immunoglobulin molecule that is secreted. 

' Sharon J et al.. Nature 309*4 (1984). disc.ose construction of a chimeric gene encod.ng the var^ble 
( V) reg^n of a fKooseT^ chain specific for the hapten azophenylfrsonate and the constant (C) rego * 
a mouse kappa light chain (V„C K ). This gene was introduced into a mouse myeloma cel. l.ne. The chimeric 
gene was expressed to give a protein which associated with light chains secreted from the mye.oma ce.. 
line to aive an antibody molecule specific for azophenylarsonate. 

Sor son. Science 229:1202 ,1985). discloses that variable light-or variable heavy-chain reg.ons can be 
attached o a T^ieqSence to create fusion proteins. This article states that the potential uses for the 
, S proteins are three (1) to attach antibody specifically to enzymes for use in assays; (2) to -so.ate non- 
la oroteins by antigen columns; and (3) to specifically deliver toxic agents. 

9 Recent techniques for the stable introduction of immunoglobulin genes into mye.oma cells (Baner,. J 
et a. CeH 33 729 740 (1983); Potter. H.. et a... Proc. Natl. Acad. Sri. (USA) 81:7161-7165 (1984) . coupled 
Stfdelled-structura. information, have p»mii53ThTuib of jn vitro DNA methods such as mutagenesis. 

5 to oenerate recombinant antibodies possessing novel properties. ,„, K «H iae 

PCT Application W087/02671 discloses methods for producing generally engineered ant.bod.es o 
desired variable region specificity and constant region properties through gene Coning and express.on of 
Soh and heavy chains. The mRNA from cloned hybridoma B cell lines which produce monoclonal 
a'nldies of des ed specificity is iso.ated for cDNA Coning. The generation of .ight and heavy cham 

s o fng quinces is accomp.ished by excising the Coned variab.e regions and .igating therr i to hgh t o 
heTy cha* module vectors. This gives cDNA sequences which code for immunoglobuhn cha.ns. The lack 
oHntrons Iws these cDNA sequences to be expressed in prokaryotic hosts, such as bactena. or .n lower 
eukaryotic hosts, such as yeast. 
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The generation of chimeric antibodies in which the antigen-binding portion of the immunoglobulin is 
fused to other moieties has been demonstrated. Examples of non-immunoglobulin genes fused to anti- 
bodies include Stanphylococcus aureus nuclease, the mouse oncogene c-myc. and the Klenow fragment of 
E. coli DNA polymerase I (Neuberger, M.S., et aL Nature 312:604-612 (1984): Neuberger. M.S.. Trends in 
5 Biochemical Science . 347-349 (1985)). European Patent Application 120,694 discloses the genetic engineer^ 
ing of the variable and constant regions of an immunoglobulin molecule that is expressed in E. coli host 
cells. It is further disclosed that the immunoglobulin molecule may be synthesized by a host ceii with 
another peptide moiety attached to one of the constant domains. Such peptide moieties are described as 
either cytotoxic or enzymatic. The application and the examples describe the use of a lambda-like chain 
70 derived from a monoclonal antibody which binds to 4-hydroxy-3-nitrophenyl (NP) haptens. 

European Patent Application 125,023 relates to the use of recombinant DNA techniques to produce 
immunoglobulin molecules that are chimeric or otherwise modified. One of the uses described for these 
immunoglobulin molecules is for whole-body diagnosis and treatment by injection of the antibodies directed 
to specific target tissues. The presence of the disease can be determined by attaching a suitable label to 
;s the antibodies, or the diseased tissue can be attacked by carrying a suitable drug with the antibodies. The 
application describes antibodies engineered to aid the specific delivery of an agent as "altered antibodies." 

PCT Application W083/101533 describes chimeric antibodies wherein the variable region of an im- 
munoglobulin molecule is linked to a portion of a second protein which may comprise the active portion of 
an enzyme. 

20 Boulianne et a!., Nature 312:643 (1984) constructed an immunoglobulin gene in which the DNA 
segments that encode mouse variable regions specific for the hapten trinitrophenol (TNP) are joined to 
segments that encode human mu and kappa regions. These chimeric genes were expressed to give 
functional TNP-binding chimeric IgM. 

Morrison et aL, P.N.A.S. (USA) 81:6851 (1984), disclose a chimeric molecule utilizing the heavy-chain 

25 variable region exons of an anti-phosphoryl choline myeloma protein G, which were joined to the exons of 
either human kappa light-chain gene. The. genes were transfected into mouse myeloma cell lines, 
generating transformed cells that produced chimeric mouse-human IgG with antigen-binding function. 

Despite the progress that has been achieved on determining the mechanism of HIV infection, a need 
continues to exist for methods of treating HIV viral infections. 

30 

SUMMARY OF THE INVENTION 



35 The invention relates to a gene comprising a DNA sequence which encodes a fusion protein comprising 
1) CD4. or a fragment thereof which binds to HIV gpl20, and 2) an immunoglobulin light or heavy chain; 
wherein said CD4 or HIV gpl20-binding fragment thereof replaces the variable region of the light or heavy 
immunoglobulin chain. - 

The invention also relates to vectors containing the gene of the invention and hosts transformed with the 

40 vectors. 

The invention also relates to a method of producing a fusion protein comprising CD4. or fragment 
thereof which binds to HIV gpl20, and an immunoglobulin light or heavy chain, wherein the variable region 
of the immunoglobulin light or heavy chain has been substituted with CD4. or HIV gpi20-binding fragment 
thereof, which comprises: 

45 cultivating in a nutrient medium under protein producing conditions, a host strain transformed with the 

vector containing the gene of the invention, said vector further comprising expression signals which are 

recognized by said host strain and direct expression of said fusion protein, and 

recovering the fusion protein so produced. 

The invention also relates to a fusion protein comprising CD4. or fragment thereof which is capable of 
so binding to HIV gpl20. fused at the C*terminus to a second protein which comprises an immunoglobulin light 

or heavy chain, wherein the variable region of said light or heavy chain is substituted with CD4 or a HIV 

gpi20 binding fragment thereof. 

• The invention also relates to an irnmunoglobulin-tike molecule comprising the fusion protein of the 
invention together with an immunoglobulin light or heavy-chain, wherein said immunoglobulin like molecule 
55 binds HIV gpl20. 

The IgGl fusion proteins and immunoglobulin-like molecules may be useful for both complement- 
mediated and cell-mediated (ADCC) immunity, while the IgM fusion proteins are useful principally through 
complement-mediated immunity. 
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The invention also relates to a complex between the fusion proteins and immunoglobulin-like molecule 

0< ^:7Z:^ZZl!s \o a method for treating H,V or S.V infections comprising admin.stenng the 
fusion orotein or immunoglobulin-like molecule of the invention to an an.mal. 

The invention further relates to a method for detecting HIV gp1 20 in a sample compns.ng conrang. 
sample suspected of containing HIV or gpl20 with the fusion protein or immunoglobul.rvl.ke molecule of the 
invention, and detecting whether a complex has formed. 



DESCRIPTION OF THE PREFERRED EMBODIMENTS 
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The invention is directed to a protein gene which comprises 

t) 7Sna seouence which codes for CD4, or fragment thereof which binds to H.V g P i20. fused to 
2) a DNA sequence which encodes an immunoglobulin heavy chain. 

Preferably, the antibody has effector function. 
The invention is also directed to a protein gene which comprises 

1) a DNA sequence which codes lor CD4. or fragment thereof which binds to HIV gpi20. fused to 
2 a DNA sequence which encodes an immunoglobulin light chain; wherein said sequence which 
codes for CD4. or HIV gpl20-binding fragment thereof, replaces the variable region of the light .m- 
munoglobulin chain. 

The invention is also directed to the expression of these novel fusion proteins in transformed hosts and 
the use thereof to treat and diagnose HIV infections. In particular, the invention relates to expressing said 
genes in mammalian hosts which express complementary light or heavy i ™ m ™^ » B ™ 

immunoglobulin-like molecules which have antibody effector function and also b.nd to HIV or S IV gpl20 
"e term "antibody effector function" as used herein denotes the ability to fix complement or to 

3 „ 3C %Te A »u2n proteins and immunoglobulin-like molecules may be administered to an animal for the 
purpose of treating H.V or S.V infections. By the terms "HIV infections" is intended the condition of having 
SdS. AIDS related complex (ARC) or where an animal harbors the A.DS v.rus. but does not exhibit he 
clinical symptoms of AIDS or ARC. By the terms "SIV infections" is intended the condition of being infected 
with simian immunodeficiency virus. * 

Bv the term "animal" is intended all animals which may derive benefit from the administration of the 
fusion proteins and immunoglobulin-like molecules of the invention. Foremost among such animals are 
humans however, the invention is not intended to be so limited. 

5 ihe term "fusion protein" is intended a fused protein comprising CD4. or fragment thereof which is 
caoable of binding to gpi20. linked at its C-terminus to an immunoglobulin chain wherein a portion of the N- 
erminus of the immunoglobulin is replaced with CD4. In general, that portion of immunoglobulin which .s 
de£ ted is the variable region. The fusion proteins of the invention may also comprise immunoglobulins 
where more than jus, ^variable region has been deleted and replaced with CD4 or HIV gpl ,20 binding 
fragment thereof. For example, the V M and CH1 regions of an immunoglobulin chain may be deleted. 
Preferably, any amount of the N-terminus of the immunoglobulin heavy chain can be deleted as long as the 
remaining fragment has antibody effector function. The minimum sequence required for binding . com- 
plement encompasses domains CH2 and CH3. Joining of Fc portions by the hinge region is advantageous 
for increasing, the efficiency of complement binding. 

The CD* portion of the fusion protein may comprise the complete CD4 sequence, the 370 ammo acd 
extracellular region and the membrane spanning domain, or the extracellular region. The fusion protein may 
comprL fragments of the extracellular region obtained by cutting the DNA sequence which encodes CD4 
at The Bspm' site a, position 514 or the Pvul. site at position 629 (see Tab.e 1) .0 9-^ 
sequences which encode CD4 fragments which retain binding to gpl20. In general, any fragment of CD4 
mav be used as long as it retains binding to gpl20. 

Where the fusion protein comprises an immunoglobulin light chain, i, is necessary .hat no more of he 
,a cZ be de.e.ed than is necessary to form a stable complex with a heavy chain lg. In pamcu lar the 
cystefne residues necessary for disu.fide bond formation must be preserved on both the heavy and light 

^ WheTexpressed in a host. e.g.. a mammalian cel.. the fusion protein may associate with other light or 
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heavy Ig chains secreted by the cell to give a functioning immunoglobulin-like molecule which is capable of 
binding to gpl20. The gpl20 may be in solution, expressed on the surface of infected cells, or may be 
present on the surface of the HIV virus itself. Alternatively, the fusion protein may be expressed in a 
mammalian cell which does not secrete other light or heavy Ig chains. When expressed under these 
5 conditions, the fusion protein may form a homodimer. 

Genomic or CDNA sequences may be used ir> the practice of the invention. Genomic sequences are 
expressed efficiently in myeloma cells, since they contain native promoter structures. 

The constant regions of the antibody cloned and used in the chimeric immunoglobulin-like molecule 
may be derived from any mammalian source. The constant regions may be complement binding or ADCC 

io active. However, preliminary work (see Examples) indicates that the fusion proteins of the invention may 
mediate HIV or SIV infected cell death by an ADCC or complement-independent mechanism. The constant 
regions may be derived from any appropriate isotype, including IgGl. lgG3, or IgM. 

The joining of various DNA fragments, is performed in accordance with conventional techniques, 
employing blunt-ended or staggered-ended termini for ligation, restriction enzyme digestion to provide 

75 appropriate termini, filling in of cohesive ends as appropriate, alkali and phosphatase treatment to avoid 
undesirable joining, and ligation with appropriate ligases. The genetic construct may optionally encode a 
leader sequence to allow efficient expression of the fusion protein. For example, the leader sequence 
utilized by Maddon et al., Cell 42:93-104 (1985) for the expression of CD4 may be used. 

For cDNA, the cDNA may be cloned and the resulting clone screened, for example, by use of a 

20 complementary probe or by assay for expressed CD4 using an antibody as disclosed by Dalgleish et al., 
Nature 312:763-766 (1984): Klatzmann et al.. Immunol. Today 7:291-297 (1986); McDougal et aT,~J. 
Immunol. 135:3151-3162 (1985); and McDougal. J. et al.. J. Immunol 137:2937-2944 (1986). ~ 
To express the fusion hybrid protein, transcriptional and translational signals recognized by an 
appropriate host element are necessary. Eukaryotic hosts which may be used include mammalian cells 

25 capable of culture in vitro , particularly leukocytes, more particularly myeloma cells or other transformed or 
oncogenic lymphocytes, e.g.. EBV-transformed cells. Alternatively, non-mammalian cells may be employed, 
such as bacteria, fungi, e.g., yeast, filamentous fungi, or the like. 

Preferred hosts for fusion protein production are mammalian cells, grown in vitro in tissue culture or in 
v ' v o in animals. Mammalian cells provide post translational modification to immunoglobulin protein mc£ 

30 ecules which provide for correct folding and glycosylation of appropriate sites. Mammalian cells which may 
be useful as hosts include cells of fibroblast origins such as VERO or CHO-K1 or cells of lymphoid origin, 
such as the hybridoma SP2/0-AG14 or the myeloma P3x63Sgh, and their derivatives. For the purpose of 
preparing an immunoglobulin-like molecule, a ptasmid containing a gene which encodes a heavy chain 
immunoglobulin, wherein the variable region has been replaced with CD4 or fragment thereof which binds to 

as gpl20, may be introduced, for example, into J558L myeloma cells, a mouse plasmacytoma expressing the 
lambda-1 light chain but which does not express a heavy chain (see Oi et al.. P.N.A.S. (USA) 80:825-829 
(1983)). Other preferred hosts include COS cells. BHK cells and hepatoma Tells. 

The constructs may be joined together to form a single DNA segrrent or may be maintained as 
separate segments, by themselves or in conjunction with vectors. 

40 Where the fusion protein is not glycosylated, any host may be used to express the protein which is 
compatible with replicon and control sequences in the expression plasmid. In general, vectors containing 
replicon and control sequences are derived from species compatible with a host cell are used in connection 
with the host. The vector ordinarily carries a replicon site, as well as specific genes which are capable of 
providing phenotypic selection in transformed cells. The expression of the fusion protein can also be placed 

45 under control with other regulatory sequences which may be homologous to the organism in its untransfor- 
med state. For example, lactose-dependent E. coli chromosomal DNA comprises a lactose or lac operon 
which mediates lactose utilization by elaborating the enzyme beta-galactosidase. The lac control elements 
may be obtained from bacterial phage lambda placS. which is infective for E. coli. The lac promoter- 
operator system can be induced by IPTG. 

so Other promoters/operator systems or portions thereof can be employed as well. For example, colicin 
El. galactose, alkaline phosphatase, tryptophan, xylose, tax. and the like can be used. 

For mammalian hosts, several possible vector systems are available for expression. One class of 
vectors utilize DNA elements which are derived from animal viruses such as bovine papilloma virus, 
polyoma virus, adenovirus, vaccinia virus, baculovirus. retroviruses RSV, MMTV or MOMLV), or SV40 virus. 

55 Cells which have stably integrated the DNA into their chromosomes may be selected by introducing one or 
more markers which allow selection of transfected host cells. The marker may provide for prototropy to an 
auxotrophic host, biocide resistance, e.g.. antibiotics, or heavy metals such as copper or the like. The 
selectable marker gene can be either directly linked to the DNA sequences to be expressed, or introduced 
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,o assembly to form the immunoglobulin-like molecule. immnr1ali2ed ce n s primarily myeloma or 

-rho hn«t cells (or immunoglobulin production may be .mmortalizea cens. primarily v 
, nhl a ceus The e ce^s may be grown in appropriate nutrient medium in culture ..asks or ,n,ec ed into 
lymphoma cells, inese cens may y w Kr . de m0use or hamster 

a synergistic host. e.g.. mouse or a rat or immunode ^ o ho* *te ud ^ 

subcutaneously and the chimenc antibody is harvested Mrom tne _d oo 

ecco-aanc. .it* convenwna. commons, such » p » $ be purilreo by 

cS. cons.am reoions ol im,™noglob u .ins are nugm. to- sample, by Bob,nson. H. el aL. PCT Appl.ca„on. 

«™ ai a n inM laGl or laG3 antibody which binds complement at the Fc region. 

may bind with complement, which mediates destruction ot tne ce.i. m 

i„»anl 1, S"oLy con?,, n a .apioiabe, » .herape* aoen, »h,ch enhances destrocpn o, the H,v 

,5 the in»enlK.n tor use ,n Hty-therapy are I I • ^m,,,™™ with neutron radiation. 

' ^21^*™^^^; - -r'to S. ,s,on protein 0, *T»S^Z 

M ^itZXic aciP (DTPA). T»r*a, examp.es o. metaiiic cations .h«* are bouno ,n ,h,s 

„. 99m T , i23i .3'| ? ? Ru. ' ? Cu. 6 7 Ga. and 08 Ga. 
manner are ie . . • 0 Qlobulin-nke molecule of the invention may be tagged w.th an 

,«e"^ 
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surface. The A chain actually enters the cytoplasm and inhibits protein synthesis by inactivating elongation 
factor 2. the factor that translocates ribosomes along mRNA concomitant with hydrolysis of ETP. See 
Darnell, J. f et at. in Molecular Cell Biology , Scientific American Books, Inc.. page 662 (1986). Alternatively, 
a fusion protein comprising ricin, a toxic lectin, may be prepared. 
5 Introduction of the chimeric molecules by gene therapy may also be contemplated, for exa— Die. using 
retroviruses or other means to introduce the genetic material encoding the fusion proteins into suitable 
target tissues. In this embodiment, the target tissues having the cloned genes of the invention may then 
produce the fusion protein in vivo. 

The dose ranges for the administration of the fusion protein or immunoglobulin-like molecule of the 
70 invention are those which are large enough to produce the desired effect whereby the symptoms of HIV or 
SIV infection are ameliorated. The dosage should not be so large as to cause adverse side effects, such as 
unwanted cross-reactions, anaphylactic reactions, and the like. Generally, the dosage will vary with the age. 
condition, sex and extent of disease in the patient, counterindications, if any, immune tolerance and other 
such variables, to be adjusted by the individual physician. Dosage can vary from .01 mg/kg to 50 mg/kg, 
/5 preferably 0.1 mg/kg to 1.0 mg/kg, of the. immunoglobulin-like molecule in one or more administrations 
daily, for one or several days. The immunoglobulin-like molecule can be administered parenterally by 
injection or by gradual perfusion over time. They can be administered intravenously, intraperitoneal^, 
intramuscularly, or subcutaneously. 

Preparations for parenteral administration include sterile or aqueous or non-aqueous solutions, suspen- 
se sions, and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil. and injectable organic esters such as ethyl oleate. Aqueous carriers include 
water, alcoholic/aqueous solutions, emulsions or suspensions, including saline and buffered media. Paren- 
teral vehicles include sodium chloride solution. Ringer's dextrose, dextrose and sodium chloride, lactated 
Ringers, or fixed oils. Intravenous vehicles include fluid and nutrient replenishes, electrolyte replenishes. 
25 such as those based on Ringer's dextrose, and the like. Preservatives and other additives may also be 
present, such as. for example, antimicrobials, antioxidants, chelating agents, inert gases and the like. See. 
generally. Remington's Pharmaceutical Science . 16th Ed.. Mack Eds., 1980. 

The invention also relates to a method for preparing a medicament or pharmaceutical composition 
comprising the components of the invention, the medicament being used for therapy of HIV or SIV infection 
30 in animals. 

The detection and quantitation of antigenic substances and biological samples frequently utilized 
immunoassay techniques. These techniques are based upon the formation of the complex between the 
antigenic substance, e.g., gpl20, being assayed and an antibody or antibodies in which one or the other 
member of the complex may be detectably labeled. In the present invention, the immunoglobulin-like 

35 molecule or fusion protein may be labeled with any conventional label. 

Thus, the hybrid fusion protein or immunoglobulin-like molecule of the invention can also be used in 
assay for HIV or SIV viral infection in a biological sample by contacting a sample, derived from an animal 
suspected of having an HIV or SIV infection, with the fusion protein or immunoglobulin-like molecule of the 
invention, and detecting whether a complex with gpi20, either alone or on the surface of an HIV-infected 

40 cell, has formed. 

For example, a biological sample may be treated with nitrocellulose, or other solid support which is 
.capable of immobilizing cells, cell particles or soluble protein. The support may then be washed with 
suitable buffers followed by treatment with the fusion protein which may be detectably labeled. The solid 
phase support may then be washed with the buffer a second time to remove unbound fusion protein and 
45 the label on the fusion protein detected. 

In carrying out the assay of the present invention on a sample containing gpl20, the process 
comprises: 

a) contacting a sample suspected containing gpi20 with a solid support to effect immobilization of 
gpl20, or cell which expresses gpi20 on its surface; 
so b) contacting said solid support with the detectably labeled immunoglobulin-like molecule or fusion 

protein of the invention; 

c) incubating said detectably labeled immunoglobulin-like molecule with said support for a sufficient 
amount of time to allow the immunoglobulin-like molecule or fusion protein to bind, to the immobilized 
gpl20 or cell which expresses gpi20 on its surface: 
55 d) separating the solid phase support from the incubation mixture obtained in step c); and 

e) detecting the bound immunoglobulin-like molecule or fusion protein and thereby detecting and 
quantifying gpl20. 
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Alternatively labeled immunoglobulin-like molecule (or fusion protein) -gpt20 complex in a sample may 
be separated from a reaction mixture by contacting the complex with an immobilized antibody or prote.n 
wh,ch is specific for an immunoglobulin or. e.g. protein A, protein G. anti-IgM or anti-lgG antibod.es. Such 
antiimmunoglobulin antibodies may be monoclonal or polyclonal. The solid support may then be washed 
with suitable buffers to give an immobilized gpi20-<abeled immunoglobulin-like molecule ant.body complex. 
The label on the fusion protein may then.be detected to give a measure of endogenous gpi20 and. 

th6r Th^ Elates to a method for detecting HIV or SIV viral infection in a sample 

COmP, S> n contacting a sample suspected of containing gpl20 with a fusion protein or immunoglobulin-like 
molecule comprising CD4, or fragment thereof which binds to gpi20. and the Fc port.on of an ,m- 
munoglobulin chain. 

(b) detecting whether a complex is formed. 

The invention also relates to a method of detecting gpl20 in a sample, further comprising 

(c) contacting the mixture obtained in step (a) with an Fc binding molecule, such as an ant.body. 
protein A, or protein G. which is immobilized on a solid phase support and is specific for the hybrid fus.on 
protein to give a gpl20 fusion protein-immobilized antibody complex 

(d) washing the solid phase support obtained in step (c) to remove unbound fus.on prote.n. 

(e) and detecting the label on the hybrid fusion protein. 

Of course the specific concentrations of detectably labeled immunoglobulin-like molecule (or fusion 
protein) and gpl20. the temperature and time of incubation, as well as other assay cond.t.ons may be 
varied depending on various factors including the concentration of gpl20 in the sample, the nature of the 
sample, and the like. Those skilled in the art wild be able to determine operative and optimal assay 
conditions for each determination by employing routine experimentation. 

Other such steps as washing, stirring, shaking, filtering and the like may be added to the assays as is 
customary or necessary for the particular situation. . , _ . ■„ 

One of the ways in which the immunoglobulin-like molecule or fusion prote.n of the present invention 
can be detectably labeled is by linking the same to an enzyme. This enzyme, in turn, when later exposed to 
its substrate will react with the substrate in such a manner as to produce a chemical moiety which can be 
detected as 'for example, by spectrophotometric. fluorometric or by visual means. Enzymes which can be 
used to detectably label the immunoglobulin-like molecule or fus.on protein of the present invention mclude 
Z are not limited to. malate dehydrogenase, staphylococcal nuclease. delta-V-steroid isomerase. yeast 
alcohol dehydrogenase, alpha-glycerophosphate dehydrogenase, triose phosphate .somerase. horseradish 
Peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase. r.bonuclease. urease, 
catalase. glucose-Vl-phosphate dehydrogenase, glucoamylase and acetylcholine esterase. 

The immunoglobulin-like molecule or fusion protein of the present invention may also be labeled with .a 
radioactive isotope which can be determined by such means as* the use of a flamma counter or a 
sedation counter or by autoradiography. Isotopes which are particularly useful for ..he purpose of the 
present invention are: '»■. ' 3 M. *P. 3S S. "C. »'Cr. *CI. "Co. "Co. -Fe and -Se. 

It is also possible to label the immunoglobulin-like molecule or fus.on prote.n with a fluorescent 
compound. When the fluorescently labeled immunoglobulin-like molecule is exposed to light of the proper 
wave length, its presence can then be detected due to the fluorescence of the dye. Among the most 
commonly used fluorescent labelling compounds are fluorescein isothiocyanate. rhodam.ne. phycoerythenn. 
phycocyanin. allophycocyanin. o-phthaldehyde and fluorescamine. ,, k »,„h 
The immunoglobulin-like molecule or fusion protein of the invention can also be detectably labeled 
usinq fluorescence emitting metals such as '»Eu. or others of the lanthanide series. These metals can be 
attached to the immunoglobulin-like molecule or fusion protein using such metal chelat.ng groups as 
diethylenetriaminepentaacetic acid (DTPA) or ethylenediaminetetraacet.c ac.d (EDTA) Hetertah . 

The immunoglobulin-r.ke molecule or fusion protein of the present .nvent.on also can be detectably 
labeled by coupling it to a chemiluminescent compound. The presence of the chemiluminescent-tagged 
immunoglobulin-like mo.ecu.e or fusion protein is then determined by detecting the presence .of um.nes- 
cTce that arises during the course of a chemical reaction. Examples of particularly oseh.1 chemilumines- 
cent labeling compounds are luminol. isoluminol. theromatic acridinium ester, imidazole, acr.dm.um salt and 

^^Ukewise' a bioluminescent compound may be used to label the immunoglobulin-like mo.ecu.e or fusion 
protein of the present invention. Bio.uminescence is a type of chemiluminescence found .n b.olog.cal 
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systems in which a catalytic protein increases the efficiency of the chemiluminescent reaction. The 
presence of a bioluminescent protein is determined by detecting the presence of luminescence. Important 
bioluminescent compounds for purposes of labeling are luciferin, luciferase and aequorin. 

Detection of the immunoglobulin-like molecule or fusion protein may be accomplished by a scintillation 

5 counter, for example, if the detectable label is a radioactive gamma emitter, or by a fluorometer, for 
example, if the label is a fluorescent material. In. the case of an enzyme label, the detection can be 
accomplished by colorimetric methods which employ a substrate for the enzyme. Detection may also be 
accomplished by visual comparison of the extent of enzymatic reaction of a substrate in comparison with 
similarly prepared standards. 

to The assay of the present invention is ideally suited for the preparation of a kit. Such a kit may comprise 
a carrier means being compartmentalized to receive in close confinement therewith one or more container 
means such as vials, tubes and the like, each of said container means comprising the separate elements of 
the immunoassay. For example, there may be a container means containing a solid phase support, and 
further container means containing the detectably labeled immunoglobulin-like molecule or fusion protein in 

is solution. Further container means may contain standard solutions comprising seriaJ dilutions of analytes 
such as gpt20 or fragments thereof to be detected. The standard solutions of these analytes may be used 
to prepare a standard curve with the concentration of gpl20 plotted on the abscissa and the detection 
signal on the ordinate. The results obtained from a sample containing gpl20 may be interpolated from such 
a plot to give the concentration of gpi20. 

20 The immunoglobulin-like molecule or fusion protein of the present invention can also be used as a stain 
for tissue sections. For example, a labeled immunoglobulin-like molecule comprising CD4 or fragment 
thereof which binds to gp120 may be contacted with a tissue section, e.g.. a brain biopsy specimen. This 
section may then be washed and the label detected. 

The following examples are illustrative, but not limiting the method and composition of the present 

25 invention. Other suitable modifications and adaptations which are obvious to this skill in the art are within 
the spirit and scope of this invention. 



EXAMPLES 

30 



Example 1 : Preparation of CD4-lg cDNA Constructs 

35 The extracellular portion of the CD4 molecule (See Madden, P.J., et al.. Ceil 42:93-104 (1985)) was 
fused at three locations in a human IgGl heavy chain constant region gene"by means of a synthetic splice 
donor linker molecule. To exploit the splice donor linker, a BamHI linker having the sequence 
CGCGGATCCGCG was first inserted at amino acid residue 395 of the CD4 Drecursor sequence (nucleotide 
residue 1295). A synthetic splice donor sequence * 

40 

GATCCCGAGGGTGAGTACTA 

GGCTCCCACTCATGAT7CGA 

45 

bounded by BamHI and Hindlll complementary ends was created and fused to the Hindlll site in the intron 
preceding the CH1 domain, to the Espl site in the intron preceding the hinge domain, and to the Banl site 
preceding the CH2 domain of the lgG1 genomic sequence. Assembly of the chimeric genes by ligation at 
the BamHI site afforded molecules in which either the variable (V) region, the V + CH1 regions, or the V. 
CH1 and hinge regions were replaced by CD4. In the last case, the chimeric molecule is expected to form a 
monomer structure, while in the former, a dimeric molecule is expected. 

On such genetic construct which contains the DNA sequence which encodes CD4 linked to human IgGl 
at the Hind3 site upstream of the CHI region (fusion protein CD4H7I) is depicted in Table 1. The plasmid 
containing this genetic construct (pCD4H 7 l) has been deposited in E. coli (MC1061/P3) at the American 
Type Culture Collection (ATCC) under the terms of the Budapest Treaty and given accession number 
" 67611. 

A second genetic construct which contains the DNA sequence which encodes CD4 linked to human 
IgGl at the Esp site upstream of the hinge region (fusion protein CD4E>1) is depicted in Table 2. The 
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plasmid containing this genetic construct < P CD4E,1) has been deposited in E_ coli <MCt06l,P3> at the 
ATCC under the terms of the Budapest Treaty and given accession number 67610. 

A third genetic construct which contains the DNA sequence which encodes CD4 UnKed «c .human M£ , at 
the Ms,2 site upstream of the CH, reg,on (fusion protem CWMu, » 

conti^ng this genetic construct (PCD4Mu) has been depos.ted .n E. coli <MC1061»P3) at the ATCC under 
the terms of the Budapest Treaty and given accession number 67608. 

A fourth genetic construct which contains the DNA sequence which encodes CD4 .in ed to human 
at the Pst site upstream of the CH2 region (fusion protein CD4Pu) is depicted .n Table 4. The piasmrd 
contain^ this genetic construct (PCD4Pu) has been deposited in E. coli (MC1061/P3) at the ATCC under 
the terms of the Budapest Treaty and given accession number 67609. 

A fTtth genetic construct which contains the DNA sequence which encodes CD4 linked to human IgGl at 
the Bam site downstream from the hinge region (fusion protein C04B T D is depicted in Table 5. 

Two similar constructs were prepared from the human IgM heavy chain constant region by fusion with 
the introns upstream of the u CM and CH2 domains at an MStll site and a PStl site respectively. The 
fusions were made by joining the PStl site of the CD4/I 9 G1 construct fused at the Esp site in igGt gene to 
the MStll and Pst sites in the IgM gene. In the first instance, this was perlormed by treatment of the Pst end 
with T4 DNA Polymerase and the MStll end with E_ coli DNA Polymerase, followed by ligation; and in the 

second instance, by ligation alone. r+r\ A 
immunoprecipitation of the fusion proteins with a panel of monoclonal antibodies directed against CD4 
epitopes showed that all of the epitopes were preserved. A specific high affinity association ,s demonstrated 
between the chimeric molecules and HIV envelope proteins expressed on the surface of cells transfected 
with an attenuated (reverse transcriptase deleted) proviral construct. 
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CCCTGTTTCACAACCACCCGCCAACAAACACCCAAGCCCACACGCCCTGCCATTTCTCTC 

1 .-- * 60 

CGCACAAACTCTTCGTCGCCCG77C7TTCTCCGTTCGCGTCTCCGGGACCGTAAAGACAC 
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GCCTCAGGTCCCTACTGCCTCAGGCCCCTCCCTCCCTCGCCAAGCCCACAATCAACCCCC 

61 ♦ 120 

CCCAGTCCAGGCATGACCCACTCCGCCCACGCACGGACCCGTTCCCCTCTTACTTCGCCC 

M N R G - 

H F F 

I B N HH NMD 

N B U HA UNO 

F V 4 AE 4 L E 

1 1 H 12 H 1 1 

GACTCCCTTTTAGGCACrrGCnCTCGTCCTCCAACICGCGCTCCTCCCAGCACCCACTC 

121 - 180 

CTCAGGGAAAATCCGTGAACGAAGACCACGACGTTGACCCCCAGGAGGGTCGTCGCTGAC 

VPFRHLLLVLQLALLPAATQ- 

B EE f R A 

B C C SI 
V 0 0 A U 

IKK 11 

ACGGAAACAAAGTGCTGCTGGGCAAAAAACGCGATACAGTCCAACTCACCTCTACACCTT 

181 240 

TCCCTTTCTTTCACCACCACCCCI 1 1 I I I CCCCTATGTCACCTTCACTCCACATCTCCAA 

CKKVVLCKKCDTVELTCTAS- 
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ANl K . A9 

422 1 * 6 
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ATCACCGCTCCTTCTTAACTAAACCTCCATCCAACCTCAATCATCGCGCTGAC1CAACAA ^ 

301 TACTCCCCACGA>GAA^GAT^CCA^ 

Q C S F L T K C P S K L N 0 R A D S R R - 

S W I A ID 

MANAS BA J F N D 



EKE S F L F E 

0AA9Y ^ 12 11 

22461 



GAAGCCTTIGGGACCAAGCAAACTTCCCCCTGATCATCAAGAATCTTA^ ^ 

3 " c^CGCAAACCCTGG^CCT^GAAGGCGC^ 

S L W D Q C N F P L I I K N L K 1 E 0 S- 

M M AM AM » 

B N VNUN | J 

0 L At 91 J 

2 1 

CAGATACTrACATaCTGMGTCGACGlcCACAAGGACGACGTCCAmGaACK^CG ^ 
421 iKrArGrArcrAcirASrArcrCOCCrCTlCaCCTCCACGrrAACGATCACAAGC 

D T Y I C E V E 0 Q K E E V Q L L V F C- 
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CTAAC7CACCGTTGACAC7C7CGC7CCACCAAG7CCCCG7C7CGGAC7CGGAC7CCAACC 

LTANSOTHLLQCQSLTLTLE- 
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AGAGCCCCCCTCCTAGTAGCCCCTCACTCCAATGTACCACTCCAAGGCGTAAAAACATAC 

541 « ♦ • • * • 600 

7CTCGGGGCGACCATCA7CGGGGAGTCACGTTACA7CCTCAGGTTCCCCATTTT7GTATG 



P 


S V 


Q C 


R S P 


R 


C K 


N 






N 


BBH S 


B 




BS 


w 


MD 


ASP 


A BSSGSC 


S 


B N 


SC 


B 


ND 


LPV 


L AP7IAR 


T 


A L 


TR 


0 


LE 


UBU 


U N1NACF 


X 


N A 


NF 


2 


11 


122 


1 221111 


1 


1 4 


11 






// 


/ /// 






/ 



AGGCGGCGAACACCC7C7CCCTGTC7CACCTGCACC7CCACCA7AC7CGCACCTGGACAT 

601 * ♦ 660 

7CCCCCCC77CTGGGACAGGCACAGAG7CGACCTCGAGGTCCTATCACCG7GGACC7CTA 

CCKTLSVSQLELQDSCTW7C- 

N 

NS M | NM A 

LP B HA L 

AH 0 EE U 

31 2 11 1 

CCACTG7CT7CCAGAACCACAACAACC7CCAGTTCAAAA7AGACA7CCTCGTGCTACC*rr 

661 * ♦ 720 

CC7GACACAACC7CT7GGTCT7CTTCCACC7CAAG7TTTA7C7GTAGCACCACCA7CCAA 

TVLQNQKKVEFKIOIVVLAF- 
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TCCACMCCCaCCACCATACTCTATAAGAAAGACCCCCAACACCTJCACTTCTCCTlCC ^ 
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CACTCGCCTTTACACTTGAMAGCTGACGGGCAGTGGCGAGCICTGGTCGC^ ^ 

781 ctcaccggaaatgtcaactt^c^ 

L a F T V E K L T C S C E I Q * * R- 
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H L ML 3 2 
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^KOO^ACCcic'M^aAACCKCACMCOCCAACAACCTCCCCCK^CACCKACCC^^ 

'"^rcrciAVcrcrcirc^roiooinArcwnmccAGCoccAcoTOCACTCcc 
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u cr HS D M H SCAHM 

N TR AT D N P TRUJN 

L NF EU E L H NF9EL 

1 1131 1 11 ^ 63 J 

TCCCCCACCCCTTCCaCAGTATGCTCCCTCTCCAAACCTCACCCTGGCCCnCAAGCCA 

1020 

961 ACCGCGTCCGGAACGCAGTCATA^ 

PQALPQYAGSCNLTLALEAK- 

S BS 

F SC H 0 A 

A TR P D L 

N NF HE U 

1 11 111 

AAACAGGAAAGTTGCATCAGCAAGTGAACCTCGTGGTCATGAGAGCCACTCAGCTCCACA 

# . . • ♦ 1080 

mGTCCmCAACCTAGTCCTTCACTTCCACCACCACTACTCTCCCTGAGTCCACGTCT 

TGKLHQEVNLVVMRATQLQK- 
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N 



PS S 
ADNNPA OF AM DE A 

VRLLUU DA IN OS L 



L AAAAM9 EN UL EP U 

1 224416 11 11 11 1 

AAAATTTGACCTCTCACGTGTGCGCACCCACCTCCCCTAAGCTGATGCTGAGCTTCAAAC 

1081 - 1140 

TTTTAAACTCGACACTCCACACCCCTCGGTCGAGCGCAnCGACTACCACTCCAACTTTG 

NLTCEVWGPTSPKLWLSLKL- 
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TCCAGAACAACCAGGCAAACGTCTCGAAGCGGGAGAACCCCCTGTGGGTGCTGAACCCTC 
ACCTCTTGTTCCTCCGTTTCCACAGCTrCGCCCTCTTCCCCCACACCCACGACTTCCCAC 
ENKE AKVSKREKPVWVLNPE 
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ACCCCCCCATCTCGCACTCTOCCTCAGTGACTCCGCACAGCTCCTCCTCCAATCCAACA 
1201 TCCCCCCCTACACCCTCACACACGACTCACTC^^ 
A C M W Q C 
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AGnCCAAGACCCCTGTACCACGTCCGCCCACCTGCCCCTAGCCCTCCCACTCATCATTC 
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P A TR AT T N NOP 
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1 3 11 31 1 1 111 

CTTTCTCCGCCACGCCACCCCTCACCTTGCCmCCCCCACCCAGCGCGCTAA^ ^ 

1321 CAAAGACCCCGTCCGGTCCGCACT^ 

B A BH B P 

BASHBHHNN P SG N BS F J 

AHPHBAPAL A PI \ J J- J 

NAWAEEHRA L 1A A , V 

121112114 1 21 3 22 1 



1381 



CACCTCCCCCCACUCGTCCACACCCAATCCCCATGAC ^ 

CTCCACCCCCCKCTCCACCTCKCC^ 

c BS S ' B SS B S ™ 

N M SC DNHA H SWAAHNABSAC NS 

UN TR RLAU H TNUUALPAPLR UP 

0 L NF AAE 9 A NL99EAAN1UF DB 

2i U 2436 1 11663412211 22 

TCCCGCACACnA>CAACCCACCCCCCTCTCCGCCTCCCCCCAGCTa ^ 
1441 AGCGCCTCTcUnCTlGCCTCCa 
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MS BNN NM S BMDMHNABSAA SCB 

AA ALL UN T BBRNALPAPUU TRA 

EC NAA 4L Y VOALEAAN199 NFN 

32 134 HI 1 12213412266 111 

/ i ii mi i 

GCTCACATCCCACCACCTCTCnCCACCCTCCACCAACCCCCCATCCCTCrrCCCCCTCC 

1501 * 1560 

CCAGTGTACCCTCCTCGACACAACCTCGCACCTCGTTCCCGGGTAGCCACAAGCCGCACC 

ASTKCPSVFPLA- 

BH B NFS BS F BS 
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L N NPI NPB PUUA TR U TR 

A L L1A L1V B49E NF 4 NF 

4 1 121 121 2H63 11 H 11 

/ / / 

CACCCTCCTCCAAGACCACCTCTGCCCCCACACCCGCCCTCCGCTCCCTCCTCAACCACT 

1561 1620 

GTCCCACCACGnCTCCTGGACACCCCCGTCTCCCCCCGACCCGACCCACCACTTCCTGA 

PSSKSTSGGTAALCCLVKOY- 















NF 


A 


BH 


H 


M 


T 


H 


0 


BANHBHN 


SN 


P 


SC 


P 


A 


T 


P 


D 


AHAHBAL 


PU 


A 


PI 


A 


E 


H 


H 


E 


NARAEEA 


B4 


L 


1A 


2 


3 


1 


1 


1 


1211124 
/ // 


2H 


1 


21 
/ 



ACTTCCCCGAACCGGTCACGCTGTCG7CGAACTCACCCGCCCTGACCACCCCCCTCCACA 

1621 1680 

TGAAGGCCCnCGCCACTCCCACAGCACCnCAGTCCCCGGCACTCGTCCCCCCACCTGT 

FPEPVTVSWNSG ALTSGVHT- 
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HNC DM I M ON M SM B 
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CCnCCCGCaGTCCTAGACTCCTCACGACTCTACTCCCTCACCAGCGTCCTCACCGTCC 

1681 1740 

GCAAGCCCCCACAGGATGTCACGACTCCTGACATGAGCGACTCCTCGCACCACTCCCACC 

FPAVLQSSCLYSLSSV VTVP- 

B F B B H 

SH N ASM B NSB MI 
PP U LTN A LPB A N 

1H 4 UXL N A1V E F 

21 H 111 1 421 2 1 

CCTCCACCAGCTTGCGCACCCAGACCTACA7CTCCAACGTGAATCACAACCCCACCAACA 

1741 \B00 

CCACCTCCTCCAACCCGTCCGTCTGCATCTACACCHCCACTTACTGT7CGCC7CCT7CT 
SSSLCTQTYICNVNHKPSNT- 
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CCCCCTCCACACAAACCCGCAGCTGCTGCGCTCACACCTCCCAACACCCATATCCGCCAC ^ 

2041 CCCGCACGTCTGT^CCCCCTCC^ 
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CACCTTCTCTCCTCCCAGAT7CCAGTAACTCCCAA1CT7CTCTCTGCAGACCCCAAATCT 
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E P K S 

N BBS BS 

M NS SSC SC HS M 

A LP PTR TR AT N 

E AH INF NF EU I 

3 31 211 U 31 1 

/ / / / 

TCTGACAAAACTCACACATCCCCACCCTCCCCACCTAAGCCACCCCAGGCCTCCCCCTCC 
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1 


2346 


1 


31 


211 


/ 


/ // 




/ 




/ 


/ 



CCCTCACCTCCCTACTCCCTCACGCCCCTGCCTCCCTCGCCAACCCCACAATCAACCCGG 

61 120 

CCGACTCCAGCCATCACCGAGTCCGCGGACCGAGCGACCCGT1CCCGTGTTACT7GCCCC 

M N R G - 

H F F 

I B N HH NMD 

N B U HA U N D 

F V 4 AE 4 L E 

1 1 H 12 H 1 1 

CACTCCCTTTTAGGCACnCCrrCTGGTGCTCCAACTCGCCCTCCTCCCAGCAGCCACTC 

121 ♦ 180 

CTCAGCGAAAATCCGTCAACGAAGACCACGACCT7CACCGCGAGGAGGGTCGTCGGTGAG 

VPFRHLLLVL QLALLPAATQ- 

B EE R A 

B C C I S L 

V D 0 A U 

IKK 11 

ACGCAAAGAAACTCCTGCTGGGCAAAAAAGCGCATACAGTCCAACTCACCTGTACAGCTT 

181 ♦ - * 240 

TCCCTTrCTTrCACCACCACCCGTTTTnCCCCTATGTCACCnCACTCCACATGTCCAA 

CKKVVLGKKGDTVELTCTAS- 
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H 

WW 1 

5 B f 
D 0 

2 2 1 



CCCACAACAACAGC ATACfcATlCCACTCCAAt.A ACTCC AACC AAACA^CTCCCAA ^ 

241 CC GT CTT CT7 CI C CT AT GTT A AGG1 C AC C TTTTT C AGCTI CGT CT ATTf CT AAC ACC CTT 

Q K K S I Q F H W K N S N Q I K 1 L G N - 

B S S F ! 

NBS F AA A A NH I 

LAP 0 VU L U U H J 

AN1 K A9 U 3 DA F 

422 1 26 

ATCAGGGCTCCTlCTTAACTAAAGCICCAlCCAAGC^CAATGAlCCCGCT^ ^ 

20 301 rAGrcrCGicrArGlinGlmcrACGrACGnCCACTTACTACCCCGACTCACnCTT 

Q G S F L T K G P S K L N D R A 0 S R R- 

S S H H 

MANAS BA I J ID 

BVLUT CU N F JD 

0AA9Y L3 j 2 11 

22461 1A 1 ' 

CAACCCTTTGGGACCAACGAAACTICCCCC^CATCATC AAGAA1CTT ^ 

FPL1IKNLKIEDS- 



361 



S L W D Q C N 

S 

4 A 



M M AM.AM M 



B N VNUN | 

0 L A191 J 

*o 2 1 2161 

cacatacttacatctctgaagtggacgIccacaagcaggacgtgcaattjctact^ ^ 

421 SSii^C^'c^ 



OTYICEVEDQK 



EEVQLLVFC 



50 



55 
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B 

S s 

P T 
M Y 

1 1 
CATrCACTCCCAAnCTCACACCCACCTCCnCAGCCGCACACCCTCACCCTCACCTTCC 

4 1 CTAAOCACGCnGAGAOGTCCC^GCACCAACTCCCC 

ltansdthllqgqsltltle- 

8 E5 H 

BS SC D I S 

AP TR D N N T 

Nl NF E L FY 

22 11 1 111 

agaccccccctcgtactagcccctcagtgcaatgtaccagtccaacggctaaaaacatac 

541 ♦ - * 600 

TCTCCCGCCCACCA7CATCGGCGAGTCACGT1ACATCCTCACGTTCCGCATTTTTCTATG 

SPPCSSPSVQCRSPRGKN1Q- 

N BBH S B BS 

M MO ASP A BSSGSC S B N SC 

B ND LPV L APTIAR T A L TR 

0 LE UBU U N1NACF X N A NF 

2 11 122 1 221111 1 1 4 U 

AGCGCCGGAACACCCTCTCCGTCTCTCAGaGCAGCTCCACGATACTCCCACCTCGACAT 
501 # ... ...... «.._--- . 660 

KCCCCCCT1CTCGGAGAGCCACACAGTCGACCTCGACCTCCTATCACCGTGGACCTCTA 

■CGKTLSVSQLE.LQDSCTWTC- 

NS " , J" * 

IP B f HA L 

AH 0 ^E U 

31 2 11 1 

GCACTGTCriGCAGAACCACAACAACGTGCAGTTCAAAATACACATCCTGCTGCTACCrr 

661 ♦ ♦ ♦ 720 

CCTGACAGAACGTCnGGTCnCTTCCACCTCAAGTTTTATCTGTACCACCACGATCCAA 

7VLQNQKKVEFK I0IVVLAF- 
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HS W M 

AT N N 

EU I L 

31 1 1 

TCCACAACCCCTCCACCATAGTCTATAACAAAGAGCGCCAACACCTCCACnCTCCTTCC 

721 780 

ACGKnCCCCAGCTCGTAlCACAUncmOCCCCCnCTCCACCTCAACACGAACG 

QKASSIVYKKECEQVEFSFP- 

A AW 

L L N 

U U L 

1 1 1 

CAOCGCCTnACACTTCAAAACCTCACCCCCACTCGCCACnGTCCTCGCACCCCGACA 

761 - - - - — • 840 

GTCAGCGGAAATGTCAACTTTTCGAC"rCCCCG T CACCCCTCCA:ACCACCCTCCCCCTCT 

LA FTVEKLTCSCEL**QAER- 
P S 

H V FV A V 
P N LN U B 
H L ML 3 0 
1 1 11 A 2 

GCGCncnCOCCAACTCnCCAKACCTTrCACCTCAACAACAACCAACTGTCTCTAA 
6*1 - 900 

CCCCAAGGAGCAGGTTCACAACCTACTCGAAACTGGACTfCTTCTTCCnCACAGACATT 

A5SSKSWITFDLKNKEVSVK- 



A H 
L P 
U H 
1 1 
I 

AACCCCnACCCACGACCCTAACCTCCAGATCCGCAAGAAGCTC<CCCTCCACCTCACCC 

901 * 960 

nGCCCAATCCGKnCCCAncCAGCKTACCCGnCT7CGACCCCGACCTCCACTCCG 

RVTQDPKLQ MCKKLPLHLTL- 

BS B5S 

M SC HS D M H SCAHM 

N TR AT 0 N P TRUAN 

L NF EU E L H NF9EL 

1 11 31 1 11 11631 

/ / / / 

TCCCCCAGCCCTTGCCTCACTATCCTCCCTCTCCAAACCTCACCCTCGCCCTTCAACCGA 

961 - ♦ 1020 

ACGCCGTCCGCAACGCACTCATACCACCCACACCTTTCCA<;TCCCACCGCGAACTr<GCT 

PQALPQYACSCNLTLALEAK- 



B 


BS PS 




SM 


SCADNFAD 


A 


TA 


TRVRlUUD 


L 


EE 


NFAAAW9E 


U 


23 


11224161 


1 


/ 


/ / // 
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S BS 

F SC HO A 

A TR P D L 

N NF HE U 

1 11 111 

AAACACCAAACTTCCATCACGAACTCAACCTGCTGCTCA7CACACCCACTCACCTCCACA 

1080 

TTTCTCCTTTCAACGTACTCCnCACTTGCACCACCACTAnCTCGGTGAGTCGAGCTCT 

TCKLHQEVNLVVMRA7QLQK- 

PS s 

M ADNN=A DF AM DE A 

N VRlLUU DA IN DS I 

L AAAAV? EN UL EP U 

1 22<*1£ il 1! 11 1 

///// / / / 

AAAATTTGACCTC7GAGGTCTCCGGACCCACCTCCCCTAAGCTCATCCTCAGCTTCAAAC 

1081 1140 

TTTrAAACTCGACACTCCACACCCCTCGGTGGAGGGGAnCCACTACGACTCCAACTTTC 

NLTCEVWCPT5PKLMLSLKL- 

M T H V. OW 

N A P N OS 

L Q A L ET 

1 1 2 1 12 

TGGAGAACAAGGAGGCAAAGGTCTCCAAGCGCCAGAACCCGGTGTGCCTGCTCAACCCTC 
1141 . 1200 

ACCTCTTCnCCTCCGTTrCCACACCnCGCCCTCnCCGCCACACCCACGACnCGCAC 

ENKEAKVSKREKPVWVLNPE- 

H PS ' H 

F D M I A ADPA I 

0 D A N V VRUU N 
K E E F A AAV9 F 

1 13 11 2216 1 

AGGCCGGCA7GTGCCACTCTCTCCTCAGTGACTCGCCACAGCTCCTGCTGCAATCCAACA 

1201 ♦ 1260 

TCCGCCCCTACACCCTCACACACGACTCACTGAGCCC7C7CCACGACCACCTTACGT7CT 

AGMWQCLL SDSGQVLLESNI- 
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1261 



S SA BHF BS H 

ANA HNCP SCNV.AANXA RSD I A 

VLU PCRA PIUNM'JLHV SCD N L 

AA9 A1FL 1ADLH3A0A AA£ D U 

236 . 2111 21211A421 111 3 1 

// II I I I I / „ 

KAACGTFC7GCCCACA7CG7CCACCCCCC7CCACCCCCA7CCCCACCG7CAC7ACTAAG 

----4 ----------- 1320 

AGnCCAAGACGGGTCTACCAGGTGGCCCCACGTGCCCCTAGGCCTCCCACTCATGATTC 



K 


V L 


P 


T W S 


T P V H 


A D P 


E 






E 




BS 


SS 


F 


BS 


F 


H 


CHH 


F 


SC 


HHNCF 


N 


BSC 


N 


P 


OHA 


0 


TR 


PCCRA 


U 


BTR 


U 


H 


4AE 


K 


NF 


AA]FN 


4 


VNF 


4 


1 


712 


1 


11 


21111 


H 


111 


H 


/ 






/ 


II 




// 





CT7CACCCC7CC7CCC7GCACCCA7CCCCGC7A7CCAGCCCCAC7CCACCCCAGCAACCC 

1321 * 1380 

CAACTCGCGACCACGCACCTGCCTAGCGCCCA'ACOTCCCCCTCACdCCCGKCT-TCCG 

S s 

DBhwmna m».'\;n VNDV 

RBA5PLU PNCR. N »0= 

AVE0KA9 AL1FA L L*=j 

2132146 2111' 1 1312 

// // // 
AGCCCCCCTCTGCC7CTTCACCCGCAGCCTCTCCCCGCCCCACTCATGCTCAGGCACAGC 

13E1 1 44 ° 

TCCCGGGCAGACGGACAAG7CCCCC1CCGAGACCGCCGGGCTGACTACGAG7CCCTCTCC 

BS P B BS S 

SC F V B K S SCDHA 

TR L A A L P TRRAU 

NF M E N A 1 NFAE9 

11 1 1 i 4 2 11236 

/ / / 

CTCTrCTCGCTTmCCCACCCrCTGGCCACGCACAGGCTAGGTCCCCCTAACCCACCCC 

1441 « * 1S00 

CACAAGACCGAAAAAGCCTCCGACACCCGKCC7G7CCGA7CCACGCCGAHCCC1CCCG 



B 


B 


B 




S 


PS 


S 


DBS 


S 


M 


HNC 


ADNPA 


P 


DAP 


P 


N 


PCR 


VRLUU 


M 


EN1 


w 


L 


AIF 


AAAM9 


1 


122 


1 


1 


211 


22416 




/ 






/ 


/ // 



CTCCACACAAACCGCCACCTCC7CCCC7CACACCTCCCAACACCCATA7CCCCCACCACC 

1S01 « 1S60 

CACC7C7G7T7CCCCC7CCACCACCCCAC7C7CGACCGnC7CCC7A7ACCCCC7CC7CC 
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10 



D H DAM 

0 A D L N 
E E E U I 

1 3 111 

CTCCCCCTCACCTAAGCCCACCCCAAACCCCAAACTCTCCACTCCCTCAGCTCCGACACC 

1561 4 ♦ 1620 

CACCCCCACTCGATTCGGGTGGGCTTTCCGGTT7CAGAGCTGACCCACTCCAGCCTCTGC 

LPLT-AHPKCQTLHSLSSDT - 

cp'-pkptpkaklstpsartp- 
apdlsppqrpnsplpqlg.nl- 

's i m mm df f 

N N AB DO A 

f L EO EK n 

1 1 32 11 i 

/ / 
30 TTCTCTCCTCCCACAnCCACTAACTCCCAATCTrCTCTCTCAGGGACTCCATCCCCCCC 

1621 * 1660 

AACAGAGGACGCTCTAACCTCATTGAGCCT7AGAACACAGACTCCaCACCTACCCCCGC 



C S A S A P - 



25 



C 

»•* C 
N 0 
L R 

30 1 1 

AACCCTTncCCCCTCGTCTCnCTGACAAncC 

1681 * 171/ 

■nCCCAAAACGGGCAGCAGACGACACICrrAACG 

as TLFPLVSCENS .... 
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Table 4 



F N 








S 


B 


N S 


6 


M 


H 


DHA 


S 


U P 


B 


N 


C 


RAU 


T 


4 B 


V 


L 


A 


AE9 


X 


H 2 


1 


1 


1 


236 


1 



/ 

GCC7C7TTCAGAACCACCCCCCAAGAAAGACCCAAGCCCACACCCCC7CCCA7TTC7C7C 

1 60 

CGCACAAACTCnCGTCCCCCCncmCTCCCnCCCCKTCCGGCACCCTAAACACAC 

B PS S S 

DBS ADNPA D DHNA V HW HNC 

DAP VRLUU 0 RALU N AN PCR 

EN1 AAAV9 E AEA9 L EL AIF 

122 22416 1 2346 1 31 211 

/ / // / / / 

CGCTCACCTCCCTACTCGCTCACGCCCCTGCC7CCCTCCCCAACGCCACAA7CAACCCCC 

61 320 

CCCAGTCCAGCGATGACCCACTCCCCCGACCCACCCACCCGmCCCTCTTACncCCCC 

M N R C - 

H F F 

I B N HH NMD 

N B U HA UN D 

F V 4 AE 4 L E 

1 1 H 12 H 1 1 

GACTCCCTTT7ACCCACnGC"nCTGGTCCTGCAACTGGCCCTCCTCCCACCAGCCACTC 

121 • 180 

CTCACCGAAAA7CCGTGAACGAACACCACGACGTTCACCCCGAGCACCCTCG1CCGTGAC 

VPFRHL LLVLQLA|LLPAATQ- 

B E E R * 

B C C S L 

V 0 0 A U 

IKK 11 

AGCCAAACAAAG7CG1CC7CCCCAAAAAACCCGATACACTCGAAC7CACCTCTACAGCTT 
181 - 240 

7CCCTTICTT7CACCACGACCCGI I i I I ICCCC7A7GTCACCT1GACTCCACATC7CGAA 

GKKVVLCKKGDTVEL7C7A5- 
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H 

M M 1 

B B N 
0 0 F 
2 2. i 
CCCAGAAGAACACCA7ACAArrCCACTGGAAAAAC7CCAACCA:ATAAA'jA'nCTGGCii 

300 

CCGTCnCTTCTCGTATCnAAGCTCACCTTTTrCAGCrrGGTCTATTTCTAACACCCn 

QKKSIQFHWKNSNQIKILCN- 



K S 


I Q 


F H W K 


N 5 


N Q 


I 


K 


I 


B 




S 




S 


F 




H 


NBS 


F 


AA 


A 


A 


N 


H 


I 


LAP 


0 


VU 


L 


U 


U 


H 


N 


AN1 


K 


A9 


U 


3 


D 


A 


F 


422 


1 


26 


1 


A 


2 


1 


1 


/ 




/ 













ATCACGCCTCCnCT7AACTAAACCTCCATCCAACCTCAATGATCCCCCTGACTCAAGAA 

301 360 

TAGTCCCGAGCAAGAAnCATTrCCAGGTACCTTCCACnACTACCCCCACTGAGTTCn 

QCSFLTKCPSKLNDRADSRR- 

S S H H 
MANAS BA I A ID 
BVLUT CU N F N D 
0AA9Y L3 F L F E 
22461 1A 12 11 
/ / 
CAAGCCrnCCCACCAACCAAACrrCCCCCTGATCATCAACAATCTTAACATAGAAGACT 
361 » - . 420 

CTTCGGAAACCCTCGnCCTTTCAAGGGGCACTACTAGnCTTAGAATTCTATCnCTGA 

SLW0QCNFPL1IKNLKIE0S- 

S 

M M AW AM v 

B N VNUN I A 

0 L AL9L E 

2 1 2161 l 

// 

CAGATACrrACATCTCTCAACTCCACGACCACAACCAGCACCTGCAArrcCTACTCncC 
<21 480 

C7CTATCAATG7AGACACnCAt:C7CCTCCTCrrcC7CCTCCACGrrAACCATCACAACC 
OTYICEVEDQKEEVQLLVFC- 
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B 

S s 

P T 

M Y 

1 1 
CATrCACTCCCAAaCTGACACCCACCTGCnCACCCCCAGACCCTCACCCTCACCTlCC 

481 S£C 

CTAACTGACCGTTGAGACTCTGGCTCCACCAACTCCCCGTCTCGCAC1CCGACTGCAACC 

LTANSDTHLLQCQSLTLlLE- 

B BS H 

BS SC D VIS 

AP TR D N NT 

Nl NF E L FY 

22 11 1 111 

ACAGCCCCCaCGTACTAGCCCCTCACTCCAATCTAGGAGTCCAAGCCCTAAAAACATAC 

600 

541 TCTCCCCCCCACCAKATCCCCCACT 

S PPC5SPSVQCR5PRCKN1U- 

N BBH S B BS 

M MD ASP A BSSCSC S B N SC 

B ND LPV I APT1AR T A L TR 

0 IE UBU U N1NACF X N A NF 

2 11 122 1 221111 1 1 * 11 

ACCCCCCCAACACCCTCKCClCTCTCACCTCCACC-rCCACCATACTCCCACCTCCACAT 



660 

601 K«CCCCnCTCCCACAC«^ 



V C K T L S V S Q L E L Q D S C T W T C 

y NV A 



N 



661 



N5 b HA L 

AS 0 I EE U 

CCACTC7CnCCAGAACCACAACAACGlCCAGT7CAAAATAGACAKCTCGTCCTA^ ^ 

TVLQNQKKVEFKlDIVVL*r 

HS MM 
AT N N 

EU L L 

31 1 1 

TCCACAAGGCCTCCACCATAGTCTATAACAAACACWCGA^ ^ 
721 AGCTCT1CCGCAGGTCGT ATC^ 



Q * 



aSS3 vykkeceqvefsfp 
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A AM 

L L N 

U U L 

1 1 1 

CACTCCCCTTTACACrrCAAAAGCTCACGCCCAGTCCCGACCTGTCCTCCCACCCGCAGA 

761 SO 

CTCAGCGCAAATCTCAACrmCCAaGCCCGTCACCGCTCGACACCACCCTCCCCCTCl 
LAFTVEKLTCSCEL»»QAER- 

P S 

H U FM A V 
P NLNU 6 
H L ML 3 0 
1 1 11 A 2 
GGGCTTCCTCCTCCAAGTCrrCGATCACCTTTCACCTCAAGAACAACGAACTCTCTGTAA 

841 ♦ . -.♦ « • • 900 

CCCGAACCACCACGT7CACAACCTAGTCCAAACTGGACT7CTTGTTCCTTCACAGACATT 
ASSSKSWITFDLKNKEVSVK- 



A H 

L P 

U H 

1 1 



901 ♦ 960 

TTCCCCAATCGGTCCTCCGAnCGACGTCTACCCCnCTrCCAGCCCGAGGTGGACTGCG 
RVTQDPKLQMCKKLPLHL TL- 

BSS 
SCAHV 
TRUAN 
NF9EL 
1L631 



B 


BS PS 




SM 


SCADNPAD 


A 


TA 


TRVRLUUO 


L 


EE 


NFAAAM9E 


U 


23 


11224161 


1 


/ 


/ / // 





BS 






M SC HS 


D 


M H 


N TR AT 


0 


N P 


L NF EU 


E 


L H 


1 11 31 


1 


1 1 


/ / 







IL631 
/ / 



961 « --- — -* 1020 

ACCGGCTCCGGAACGCACTCATACCACCCACACCTTTGGACTGGCACCGGGAACnCGCT 

PQALPQYACSCNLTLALEAK- 

S BS 

F SC H D A 

A TR P D L 

N NF HE U 

1 11 111 

/ 

AAACACGAAACTTCCATCACCAACTCAACCTCCTCCTGATCAGACCCACTCACCTCCACA 

1021 10S0 

TnGTCCTTrCAACCTAGTCCTTCACTrCGACCACCACTACTCTCGGTCACTCCAGCTCT 

TGKLHQEVNLVVMRATQLQK- 
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PS 


S 








M 


ADNNPA 


OF 


AW 


DE 


A 


N 


VRLLUU 


DA 


LN 


CS 


L 


L 


AAAAM9 


EN 


UL 


EP 


U 


1 


22*416 


11 


11 


11 


1 




///// 


/ 


/ 


/ 





AAAATTTCACCTCTCACCTCTCCCCACCCACCTCCCCTAACCTCA7CCTCACCTTCAAAC 

1081 ---- 114C 

TTTTAAACTCCACACTCCACACCCCTCGCTCCAGGCGATrCCACTACCACTCCAACrnC 
NLTCEVWCPTSPKLMLSLKL- 

M T H M 01/ 

N A P N 05 

L Q A L ET 

1 1 2 1 12 

TGGACAACAACCAGCCAAACGTCTCGAACCCCCACAACCCCC7CTCCCTCCTCAACCCTC 
1H1 * 1200 

ACCTcrrcTTcncccTTrccACAccrrcccccTcncccccACAcccACGAcncccAC 

ENKEAKVSKREKPVWVLNPE- 



V 


s 


K 


R 


E K 


P V W 


V I 








H 




PS 


H 


F 


D 


M 


I 


A 


ADPA 


I 


0 


D 


A 


N 


V 


VRUU 


N 


K 


E 


E 


F 


A 


AAV9 


F 


1 


1 


3 


1 


1 


2216 


1 



/// 

AGGCGCCGATGTCGCAG1CTCTGCTCAGTCAC1CCCCACACCTCCTCCTGCAATCCAACA 

1201 - 1260 

TCCGCCCC7ACACCCTCACACACCACTCAC7CACCCC7CTCCACCACGACCT7AGGT7GT 
ACMWQCLLSDSCQVLLESNI- 

BHF BS H 

SCNVAANXA RSD I A 

PIUNVULHV SCO N I 

1A0LK3AOA AAE D U 

21211A421 111 3 1 

/ / / / / 
:ACGCCCA7CCCCAGCC7GAG7AC7AAG 

1261 * 1320 

AGT7CCAAGACCGC7G7ACCAGC7GGCCCCACG7GCGCC7ACGCC7CCCAC7CA7CAnC 



s 


SA 


ANA 


HNCP 


VLU 


PCRA 


AA9 


A1FL 


236 


2111 


// 


// 



/ L 


P 


7 W 


S T P V H 


A 0 P 


E 




E 




8S 


ss 


F 


BS 


F 


CHH 


F 


SC 


HHNCF 


N 


BSC 


N 


OHA 


0 


7R 


PCCRA 


U 


B7R 


U 


4AE 


K 


NF 


AA1FN 


4 


VNF 


4 


712 


1 


11 


21111 


H 


111 


H 






/ 


II 




// 





H 
P 
H 
1 

CT7CAGCGC7CC7CCC7CGACGCA7CCCCCC7A7GCAGCCCCAC7CCACCGCACCAACGC 



1321 - 13? 



CAAG7CGCCAGCACCCACC7CCG7ACCCCCGA7ACCTCCCCC7CAGC7CCCC7CGT7CCG 



45 



EP 0 325 262 A2 



1381 



1441 



S s 

D6HIHNA HMNCN . M WD'.' 

RB*.3PLU pncrl N N.Ds 

AVE0HA9 AL1FA I l*EC 

2132146 21114 1 1312 

AGGCCCCGTCTGCCTCTrCACCCCGAGCCTCTGCCCCCCCCACTCATGCTCAGGGAGACG 

TCCCGGCCAGACGGAGAAGTCCCCCTCGGAGACGGGCCGGGTGAGTACCAGTCCCTCTCC 

BS P B BS S 

SC F M B N S SCDHA 

TR L A A L P TRRAU 

NF M E N A 1 NFAE9 

31 1 114 2 11236 

/ / / 

GTCTTCTCGCTTTTrCCCAGCCTCTGCCCACGCACACCCTACCTCCCCCTAACCCAGCCC 

CACAAGACCGAAAAAGCGTCCGAGACCCCTCCCTCTCCCATCCACGCCCATTCGGTCCGC 



1501 



1500 



B 


B 


B 




S 


PS 


S 


DBS 


S 


M 


HNC 


AONPA 


P 


DAP 


P 


N 


PCR 


VRLUU 


M 


EN1 


M 


L 


AIF 


AAAV9 


1 


122 


1 


1 


211 


22416 


/ 






/ 


/ // 



1561 



CTGCACACAAAGGCGCACGTGCTGGCCTCAGACCTGCCAAGAGCCATATCCGGCAGCACC 

GACGTGTGTTTCCCCGTCCACGACCCCAGTCTGGACGGTTCTCGGTATACGCCCTCCTGG 

D H DAM 

D A D L N 

E E E U L 

1 3 111 

CTGCCCCTGACCTAAGCCCACCCCAAAGGCCAAACTCTCCACTClClCAGCTCCGACACC 

GACGGGGACTGGATTCGCCTCGGGTTTCCGCTTTCACACGTCACGCAGTCCAGCCTCTGC 



156C 



1620 



H F 

I M MM BP DE AN 

N N AB BS DS LU 

F L EO VT EP U4 

1 1 32 11 11 1H 

/ / / 

TTCTCTCCTCCCACAnCCAGTAACTCCCAAlCTlCTCTCTCCAGTGA'nGCTGACCTCC 

1621 - 

AAGACAGGACGCTCTAACGTCArrGACCGnAGAAGACACACCTCACTAACGACTCCACG 

V I A E L P - 
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1681 



F 

V H V V. N 

B C N B U 

OA L 0 D 

occcIaacicaccctcttcctcccaccccgccacggcticttcgccaacccccccaact ^ 

GAGGG^CACTCCCACAACCAGCC^ 

PKVSVFVPPRDCFFCNPRKS. 

BS S H BSF 

. II H HNC I B SMC N 

L PCR N B TNRU 

U NF E AIF F V NLF 4 

1 11 3 211 1 1 HI H 

/ // II 

CCAACCTCATCTCCCACCCCACCCCTTTCACTCCCCCCCACATTCACCTCTCCTCW ^ 

1741 MmCACTACACCCTCCcilC«^ 



L 1 C Q A T C F S P R Q 1 Q V S W I * 

3 

NH S H H AW 



S BS H 
L 5 u H AW AA SCW D H I 



UH P PC HA VU TRN D A K 

DA U HA AE A9 NF L £ E 

21 1 1 1 23 26 111 1 3 1 

CCCACGCCAACCAGCTGGCGTCTGCCGTCACCACCGACCACGTCCACCCTGACGa ^ 

1801 icaccccmciccAccc^ 

EGKQVCSCVTTDQVQAEAKE- 
SS B B 

AAHNABS SM o | 

UUALPAP TA P 

99EAAN1 EE H 

6634122 23 1 

ACTCTCCCCCCACCACCTACAACC7GACCACCACACTCACCATCAAACAC ... ^ 
18 " TCACACCCCCCTCCTCCATC^CCK • • 



SCPTTYKV 



T S T L T 1 K E 
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Table 5 

FN SB 

NS B M H DHA s 

UP B N C RAU T 

* B V L A AE9 X 

H 2 1 1 1 236 i 

CCC7GTT7CACAACCAGCCGCCAACAAACACCCAAGCCCACACCCCC7CCCATT7C7C7C 

1 60 

CGCACAAACTCnCGTCGCCCGTTCTTTCTCCCnCGGGTCTCCGCCACCCTAAAGACAC 

B PS S s 

DBS ADNPA D DHNA M HM HNC 

DAP VRLUU D RALU N AN PCR 

EN1 AAA.M9 E AEA9 I EL A1F 

122 22416 1 2346 1 31 211 

/ / // / / / 

GCCTCAGGTCCCTACTCCCTCACCCCCCTCCCTCCCTCCGCAACCCCACAATGAACCGGC 

61 " - ♦ 120 

CCCACTCCACCCATCACCCACTCCCCCCACCCACCGACCCCnCCGCTGTTACTTCCCCC 

M N R G - 

H F F 

I B N HH NMD 

N B U HA U N D 

F V 4 AE 4 L E 

1 1 H 12 H 1 1 

GAGTCCCTTTTAGCCACT7GCT7C7GGTCCTGCAACTGCCGCTCCTCCCAGCAGCCACTC 

121 180 

CTCACCGAAAATCCCTCAACCAACACCACGACGrrCACCCCCACCACCCTCCTCGCTCAC 

VPFRHLLLVLQLALLPAATQ- 

B E E R A 

B C C 1 S L 

V 0 0 A U 

IKK 1 ! 

AGGCAAACAAACTGGTCCTGGCCAAAAAACCGGATACAGTGGAACTCACCTCTACAGCTT 

"1 240 

TCCCTT7CTTTCACCACCACCCCTTTTTTCCCCTA7GTCACCTTCACTCCACA7G7CCAA 

CKKVVLGKKCDTVELTCTAS- 

H 



I 



B B n 

0 0 F 
2 2 ! 

CCCACAACAACACCA7ACAAT7CCACTCCAAAAACTCCAACCACATAAACAT7CTGCCAA 
241 3C - 

GGG7CnCT7C7CC7ATCT7AAGC7CACC7TmCACGnGC7C7ATTTC7AAGACCCT7 
GK^SIQFHWKNSNQIKILGN- 
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NSS F 
LAP 0 

an: k 

422 1 

atcagcgctccttcttaactaaagctccatccaacctcaatcatccccctcactcaacaa 

301 ♦ ♦ ♦ ♦ ♦ 36C 

7AGKCCCACCAACAAT7CATT7CCAGGTAGCT7CCACT7ACTACCGCGACTCACT7CTT 

QGSFLTKGPSKLNORADSRR- 

S S H H 

MANAS BA I A 10 

BVLUT CU N F NO 

0AA9Y L3 F L F E 

22461 1A 12 11 

/ / 
CAACCCTTTCCCACCAACGAAACTTCCCCCTCATCATCAACAATCTTAACATACAACACT 

361 4 20 

CnCGCAAACCCTCCnCCTTTCAAGGCCCACTAGTACrrCTTAGAAnCTATCTrCTCA 

SL WDQGNFPL I I KNLKIEDS- 

S 

M M AVAV M 

B N VNUN A 

0 L AL9L E 

2 1 2161 1 

// 

CAGATACTlACATCTCTCAACTCCAGCACCAGAACCAGCACGTGCAAnCCTACTGTTCG 

421 ... . 480 

CTCTATGAATCTACACACnCACCTCaCCTCnaTCCTCCACCTTAACGATCACAACC 

DTYICEVEOQKEE^QLLVFC- 

B 

s s 

P T 
M Y 
1 1 

CA-nCAC7CCCAACTCTCACACCCACC7CCT7CACCCCCACACCCTCACCCTCACCTTCG 

481 SAZ 

C7AAC1GACGCT7CAGAC7C7CCG7GCACGAAG7CCCCG1C7CGGACTCCCAC7GGAACC 

LTANSDTHLLQGQSLTLTLE- 
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ACACCCCCCCTGGTACTACCCCCTCAC7GCAATCTACCAGTCCAAGCGCTAAAAACATAC 

5*1 ♦ 600 

TCTCGGCCCCACCAKATCCCCCAGTCACGnACA7CC7CACGmCCCATTTT7CTATC 
SPPCSSPSVQCRSPRGKNI Q - 
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AGGGGCCCAACACCCTCTCCGTCTCTCACC7CGAGCTCCAGGATACTGCCACCTCCACAT 

601 * 660 

TCCCCCCCnCTCCCACACGCACAGACTCCACCTCGACCTCCTATCACCCTGCACCTCTA 

GCKTLSVSQLELQDSCTWTC- 

N 

NS M NV A 

LP B HA L 

AH 0 EE U 

31 2 11 1 

/ 

CCACTGTCnCCACAACCACAAGAAGCTCCACTTCAAAATACACATCCTCCTCCTACCTT 

661 - ♦ 720 

CGTCACAGAACGTCnGClCnCTlCCACCTCAAGTmATCTCTACCACCACGATCCAA 
TVLQNQKKVEFKIDIVVLAF- 

HS MM 

AT N N 

EU L L 

31 1 1 

/ I 

TCCAGAAGCCCTCCAGCATAGTCTATAAGAAACAGGCGCAACACGTCCACnCTCCTTCC 

721 • * 780 

AGGTCTTCCGGACGTCGTATCAGATATTCTTTCTCCCCCTTCTCCACCTCAACAGGAAGC 
QKASSIVYKKEGEQVEFSFP- 

A AM 

L L N 

U U L 

1 1 1 

CACTCCCCmACACrrCAAAAGCTGACCCCCAGTGCCCACCTCTCCTCCCAGGCCCACA 

781 

CTGAGCCGAAATGTCAACTTTrCCACTCCCCGTCACCGCTCCACACCACCCTCCCCCTCT 

LAFTVEKLTCSGELW ( WQAER- 
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P S 

H V FW A M 
F N IN U B 
H L ML 3 0 
] 1 11 A 2 
CCGCTTCCTCCTCCAACUnCCATCACCTTTCACCTCAAGAACAACCAACTCTCTCTAA 



841 - 9C 



CCCCAAGCACGAGGTTCACAACCTACTCCAAACTCCACnCTlCnCCrrCACACACAn 
ASSSKSWITFOLKNKEVSVK- 

8 BS PS 

SM SCADNPAD A AH 
TA TRVRLUUD I LP 
EE NFAAAM9E U U H 

23 1122^161 1 1 1 

* / / // 

AACCCGTTACCCACCACCCTAACCTCCACATCCGCAACAAGCTCCCCCTCCACCKACCC 

901 • -» * « * 960 

TTCCCCAATCCGTCCTCCGATrCCAGCTCTACCCGncnCCAGGGCGAGCTCCACTCCC 

RVTQDPKLQMCKKLPLHLTL- 

es bss 

M 5C HS D M H SCAHM 

N TR AT D N P TRUAN 

L NF EU E L H NFSEL 

1 11 31 1 11 H631 

/ / / / 

TGCCCCACCCCnCCCTCAGTATCCTCCCTCTCCAAACCTCACCCTCCCCCTTGAACCGA 

961 1026 

ACCCCCKCCCAACCCACKATACGACCCACACCTTrCCAGTCCCACCCCCAACnCCCT 

PQALPQYACSCNLTLALEAK- 

S BS 

F SC H D A 

A TR | P D L 

N NF H E U 

1 11 111 

/ 

AAACACCAAACrTCCATCACCAACTCAACCTCGTCCTCATGAGACCCACTCACCTCCACA 

1021 * • * 1080 

TTIGTCCTTTCAACCTACTCCTTCACrrCCACCACCACTACTCTCCCTCAGTCCACCTCT 

TCKLHQEVNLVVMRATQLQK- 
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PS S 

M ADNNPA DF AM. DE A 

N VRLLUU DA LN DS L 

L AAAAM9 EN UL EP U 

I 224416- 11 11 11 1 

///// / / / 

AAAATT7CACCTGTGAGGTCTGCGCACCCACCKCCCTAAGCTCATGCTCACCTTGAAAC 

1081 • 1140 

nTrAAACTGCACACTCCACACCCCTGGGTGCACCGCAnCGACTACCACTCGAACTTTG 

NLTCEVWCPTSPKLMLSLKL- 

w T H M DM 

N A P N DS 

I Q A L ET 

I 1 2 1 12 

TGCACAACAACGAGGCAAACGTCTCGAAGCCGCAGAACCCCCTGTCGGTCCTCAACCCTG 

1141 - — - 1200 

ACacrrGTTCCTCCGTnCCACAGCTTCGCCaCTTCCGCCACACCCACGACTTGGGAC 

ENKEAKVSKREKPVWVLNPE- 

H PS H 

F D M I A ADPA I 

0 D A N V VRIAJ N 
K E E F A AAV9 F 

1 13 11 2216 1 

/// 

ACGCCCCGATGTCCCAG1GTCTCC7GAGTGAC7CGCCACACC7CCTCCTGCAATCCAACA 

1201 - 5260 

TCCCCCCCTACACCGKACAGACCACTCACTCAGCCaGTCCACGACCACCnACGTTGT 

ACVWQCLLSDSGQVLLESNI- 

B 

SH 

I PP 

1H 
21 



1261 1320 

ACTTCCAACACCCGTGTACCACGTCCGCCCACCTCCCCCTACCCCTCCCAnCACACGCC 

KVLPT WSTPVHADPE 



S 


SA 


BHF BS 


ANA 


HNCP 


SCWAANXA 


VLU 


PCRA 


PIUNWJLHV 


AA9 


AIFl 


1ADLH3A0A 


236 


2111 


21211A421 


// 


// 


/ / / / 
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MF 
AO 
EK 
11 



BS S S S 

SC F DHNA HNC A M M 

TR A RALU PCR F * B 

NF N AEA9 Air LEO 

11 1 2346 211 3 2 2 



14 ------ ^ 

lAGACTACCCTCCAKc/cCCACACCC^ 

1321 ;rcrc;rcrc;rcrArGrcrcrGrcrc*GccrcG*G«*cAccAncTccAccTCGAccTACA 

BS S US 

v D M SC « ANA M u T 

N D N TR B VLU B J ■ 

L E L NF 0 AA9 0 5 1 

1 U 2 246 2 11 

CnCCTCACCACCTGUCTCOGCCGCG^ UK 

1351 iurcr&wcc^^ 

A P E L L C C P S V F L F P P K P K . 

s ss N 

AN M HVANNAC DM M NS »J 

UL N PNVCLUR DS A LP * 

3A L ALAIA9F ET E \ 

A3 1 2121461 12 3 31 2 

AGCACACCCTCATGATnCCCGGACCCCTGAGGTCACATGCCKCTCGTGW ^ 
KOGTCGGACTACTAGAGCGCO^ 



D T L 



M i S R T P E V T C V V V D V S H 



u ov m. R M M 

N DS B SA N 

L ET 0 AE L 



,TCCATAATCCCA 

1S60 



5 12 2 12 1 f 

ACCAAGACCCTGACGTCAACnCAAClGCTACCTCCACCGCGTCCACGTCCA 

1501 

c D p E V K F N W Y V D C V E V H N A K - 



S 



M 
N 
L 
1 



F FN 

r. N rr R MR HNC HH 

U UPA S AS PCR CP 

A DBC A E A AIF AH 

H 222 1 21 211 11 



ACACAAAGCCGCGGGACGACCACTACAACACCA^ ^ 
15 " K^StftfCCKCTCCTCATC™^ 

T K P R E E 8 Y N S T Y R V V 5 V L T V- 
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BS 

M SC * 

N TR S 

I NF * 

1 11 1 

KC7CCACCAGGAC7GGC7CAA7CGCAAGCAG7ACAAG7CCAACC7C7CCAACAAACCCC 

. _ _ A _ » m _ w v » •> » * v * s • • • — — • ♦ 1 680 

AGGACGTCGTCCTGACCGACnACCCrrCCTCATGTTCACGTTCCACACGnCTTTCCGG 
LHQDWLNCKEYKCKVSNKAL- 

P S 5 

,,, T ADNNPV.A A 

N A VRLLUNU U 

. n AAAAML9 S 

j i 2244116 6 

//// / 

7CCCACCCCCCATCCACAAAACCA7CTCCAAACCCAAACGTGGGACCCGTCCGC7CCCAG 

1681 ♦ ♦ ♦ • 1 ?4 0 

AGGGTCCCGCG7ACCTCTTTTGGTAGAGCTTrCGCTT7CCACCC7CGCCACCCCACCCTC 

PAP1EKTISKAK 

S N 

H M N HHN BSAH D M M S R 

AN L APA GFUA D N A PS 

EL A EAE U9E EL E B A 

3 1 3 321 1163 113 2 1 

GCCCACA7GGACAGAGCCCCGCTCGGCCCACCCTCTCCCC7GACAGTCACCCCTGTACCA 

l7 4l * 1800 

CCGG7G7ACC7C7C7CCCGCCCAGCCCGC7GGGACACGCGAC7C7CAC7GGCGACA7CG7 

F SS 

M N A B R F I AHNNCC 

N u V B SO VPCCRK 

I 4 A V A K AAI1FF 

1 HI 1 11 "1111 

ACC7C7G7CCTACAGGGCAGCCCCCACAACCACAGG7G7ACACCC7CCCCCCA7CCCCGG 

1801 ^ # ^ --<♦ i860 

7GCAGACAGGA7G7CCCC7CCCGGC7Cn 

GQPREPQVYTLPPSRO- 
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BS BS B 

S A F SC SC S 

ML 0 TR TR P 

A U K NF NF M 

11 1 11 n 1 

ATCACCTCACCAACAACCACGlCAGCCTCACaCCCTGGTCAAACCCTICTATCCCACCG 

lfi61 1920 

TACTCCACTGG-nCTTCCTCCAGTCCGACTGGACGCACCAGTTTCCCAACATACGCTCCC 

ELTKNQVSLTCLVKCFYPSD- 

F 

N H B 
UP B 
4 A V 
H 2 1 

ACA1CGCCCTGCAGTGCCACACCAA7CCCCACCCCGACAACAACTACAACACCACCCCTC 

1521 TCTACCCCCACCKACCCTCTCCnACCCCT 

JAV E*ESNGQPENNYK77PP- 

B 

K' I V N H MA S 

N N B L P N L P 

L F 0 A H LU » 

112 4 1 11 1 

CCGTCCTCCACTCCCACCCCTCCTFCTrCCTCTACACCAACCTCACCCTCGACAAGAGCA 

196j CGCACGACCTGACGCTGCCCAGCA^ 



V L 



DSDCSFFLYSKLT VDKSR 



F S 

NV VEX NF V. N N 

UB ABM LA N S L 

40 EVN AN L I A 

H2 211 31 1 1 f 

GCTGCCAGCACGGCAACCTCTTCTCATCnCCCTGATCCATGAGGCKTGCACAACCACT 

J0M CCACCC7CCTCCCCT1CCACAACM 

WQQGNVF SCSVMH£ALHNHY- 

s 

M M. HNC CXH 

B N PCR FWA 

0 L AIF RAE 

2 1 211 133 

/ / 
ACACCCACAACACCCTCTCCCTCTC7CCCCCTAAATGACTCCCACGCCCG 

Q . 4 . . • 2150 

TGTGCCTCTTCTCCCACACCGACACACGCCCATTrACTCACCCTGCCCGC 

TQKSLSLSPGK- 
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Example 2: Preparation of the Fusion Proteins from Supernatants of COS Cells 

5 

COS cells grown in DME medium supplemented with 10% Calf Serum and gentamicin sulfate at 15 
ug/ml were split into DME medium containing 10% NuSerum {Collaborative Research) and gentamicin to 
give 50% confluence the day before transfection. The next day, CsCI purified plasmid DNA was added to a 
final concentration of 0.1 to 2.0 ug/ml followed by DEAE Dextran to 400 ug/ml and chloroquine to 100 uM. 

jo After 4 hours at 37 *C. the medium was aspirated and a 10% solution of dimethyl sulfoxide in phosphate 
buffered saline was added for 2 minutes, aspirated, and replaced with DME'10% Calf Serum. 8 to 24 hours 
later, the cells were trypsinized and split 1 :2. 

For radiolabeling, the medium was aspirated 40 to 48 hours after transfection, the cells washed once 
with phosphate buffered saline, and DME medium lacking cysteine or methionine was added. 30 minutes 

75 later, 35 S-labeled cysteine and methionine were added to final concentrations of 30-60 uci and 100-200 uci 
respectively, and the cells allowed to incorporate label for 8 to 24 more hours. The supernatants were 
recovered and examined by electrophoresis on 7.5% polyacrylamide gels following denaturation and 
reduction, or on 5% polyacrylamide following denaturation without reduction. The CD4B7I protein gave the 
same molecular mass with or without reduction, while the CD4E>1 and CD4H-yl fusion proteins showed 

20 molecular masses without reduction of twice the mass observed with reduction, indicating that they formed 
dimer structures. The CD4 IgM fusion proteins formed large multimers beyond the resolution of the gel 
system without reduction, and monomers of the expected molecular mass with reduction. 

Unlabeled proteins were prepared by allowing the cells to grow for 5 to 10 days post transfection in 
DME medium containing 5% NuSerum and gentamicin as above. The supernatants were harvested, 

25 centrifuged. and purified by batch adsorption to either protein A trisacryl. protein A agarose, goat anti- 
human IgG antibody agarose, rabbit anti-human IgM antibody agarose, or monoclonal anti-CD4 antibody 
agarose. Antibody agarose conjugates were prepared by coupling purified antibodies to cyanogen bromide 
activated agarose according to the manufacturer's recommendations, and using an antibody concentration 
of 1 mg/ml. Following batch adsorption by shaking overnight on a rotary table, the beads were harvested by 

30 pouring into a sintered glass funnel and washed a few times on the funnel with phosphate buffered saline 
containing 1% Nonidet P40 detergent. The beads were removed from the funnel and poured into a small 
disposable plastic column (Quik-Sep QS-Q column, Isolab), washed with at least 20 column volumes of 
phosphate buffered saline containing 1% Nonidet P40, with 5 volumes of 0.15 N NaCI. 1 mM EDTA (pH 
8.0). and eluted by the addition of either 0.1 M acetic acid. 0.1 M acetic acid containing 0.1 M NaCI, or 0.25 

35 M glycine-HCI buffer. pH 2.5. 

Example -3: Blockage of Syncytium Formation by the Fusion Proteins f 

40 Purified or partially purified fusion proteins were added to HPB-ALL cells infected 12 hours previously 
with a vaccinia virus recombinant encoding HIV envelope protein. After incubation for 6-8 more hours, the 
cells were washed with phosphate buffered saline, fixed with formaldehyde, and photographed. All of the 
full-length CD4 immunoglobulin fusion proteins showed inhibition of syncytium formation at a concentration 
of 20 ug/ml with the exception of the 4H 7 1 protein, which was tested only at 5 ug/ml and showed partial 

45 inhibition of syncytium formation under the same conditions. 

Example 4: Chromium Release Cytolysis Assay 



50 The purified fusion proteins were examined for ability to fix complement in a chromium release assay 
using vaccinia virus infected cells as a model system. Namalwa (B cell) or HPB-ALL (T cell) lines were 
infected with vaccinia virus encoding HIV envelope protein, and t8 hours later were radiolabeled by 
incUbation in 1 mci/ml sodium Sl chromate in phosphate buffered saline for 1 hour at 37 . The labeled cells 
were centrifuged to remove the unincorporated chromate. and incubated in microtiter wells with serial 

55 dilutions of the CD4 immunoglobulin fusion proteins and rabbit complement at a final concentration of 40%. 
After 1 hour at 37* . the cells were mixed well, centrifuged. and the supernatants counted in a gamma-ray 
counter. No specific release could be convincingly documented. 
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Example 5: Binding of the CD4E-rl Protein to Fc Receptors 

Purified CD4E 7 1 fusion protein was tested for its ability to displace radiolabeled human IgGi from 
human Fc receptors expressed on COS cells in culture. The IgGi was radiolabeled with soo.um iodide 

i us.ng 1 mci of iodide. 100 ug of IgGi. and two idobeads (Pierce). The labeled prote.n was separated from 
unincorporated counts by passage over a .Sephadex G25 column equilibrated with phospnate buttered 
saline containing 0.5 mM EDTA and 5% nonfat milk. Serial dilutions of the C04E 7 1 fus.on protein or 
unlabeled IgGi were prepared and mixed with a constant amount of radiolabeled IgGi tracer. After 
incubation with COS cells bearing the FcRI and RcRII receptors at 4 C for at least 45 minutes m a volume 

0 of 20 ul 200 ul of a 3:2 mixture of dibutyl to dioctyl phthalates were added, and the cells separated from 
ihe unbound label by centrifugation in a microcentrifuge for 1 5 to 30 seconds. The tubes were cut with 
scissors and the cell pellets counted in a gamma-ray counter. The affinity of the CD4E,i prote.n for 
receptors was measured in parallel with the affinity of the authentic IgGi protein, and was found to be the 
same, within experimental error. 

'5 

Example 6: Stable Expression of the Fusion Construct pCD4E 7 l in Baby Hamster Kidney Cells 

Twenty-four hours before transaction. 0.5 x 10 6 baby hamster kidney cells (BHK; ATCC CCL10) were 
?o seeded in a 25 cm 2 culture flask in Dulbecco's modified Eagle's medium (DMEN) containing 10% of fetal 
calf serum (FCS) The cells were cotransfected with a mixture of the plasmids pCD4E 7 i (20 ug). pSV2dhfr 
(5 ug- Lee et al.. Nature 294:228-232 (1981)) and P RMH140 (5 ug. Hudziak et aL, Cell 3V.137-146 (1982)) 
according tb"a^o5if5eTcateium phosphate transfection technique as described in Zertlmeissl et aL (Behnng 
Inst Res Comm 82:26-34 (1988)). 72 h post-transfection. cells were split 1:3 to 1:4 (60 mm culture oishes) 
?5 ir^rj r"e^itirFrio7of7es were selected in DMEM medium conta.ning 10% FCS. 400 ug/ml G418 (Geneticin, 
Gibco) and 1 uM methotrexate (selection medium). The medium was changed twice a week. The resistant 
colonies (40-100/transfection) appeared 10-15 day post-transfection and were further propagated either as a 
mixture of clones (i.e.. BHK-NK1) or as individually isolated clones. For the determination of the relative 
expression levels, clone mixtures or individual clones were grown to confluency in T25 culture flasks. 
30 washed twice with protein-free DMEM medium, and incubated for 24 h with 5 ml prote.n-free DMEM 
medium These media were collected and subjected to a human IgG specific ELISA in order to determine 
the relative expression levels of the CD4-lgGl fusion protein CD4E 7 1. For further analysis an individual 
clone (BHK-UC3) was chosen due to its high relative expression levels. 

3S 

Example 7: Detection of the CD4E-yi Protein in Culture Supernatants 

For 35 S methionine labeling of cells, the clone BHK-UC3 and uctransfected BHK cells (control) were 
grown to confluency in T25 culture flasks and subsequently incubafed for two hours in HamFl2 medium 

40 withoul methionine. Labeling was achieved by incubating 24 h in 2.5 ml of the same medium containing 100 
uCi 3 -S methionine (1070 Ci/mmole. Amersham). For the preparation of cell lysates. the labeled cells were 
harvested in 1 ml of phosphate buttered saline. pH 7.2 (PBS) and lysed by repetitive freezing and thawing. 
Cleared lysates (after centrifugation 20000 rpm. 20 min) and culture supernatants were incubated with 
Protein A-Sepharose (Pharmacia) and the bound material was analyzed on a 10% SDS-Protein A-Sepharose 

-5 (Pharmacia) and the bound material was analyzed on a 10% SDS-gel according to Laemmli (Nature 
227 680-685 (1970)) which was subsequently autoradiographed. A specific band of about 80 KDa can be 
delected only in the supernatant of clone BHK-UC3. which is absent in the lysate of clone BHK-UC3 and in 
the respective controls. 

50 

Example 8: Purification of the Protein CD4E 7 1 from Culture Supernatants 

in order to demonstrate that the fusion protein coded by the plasmid pCD4E>l can be obtained in high 
quantities the clone BHK-UC3 was grown in 1750 cm 2 roller bottles in selection medium (500 ml). Confluent 
55 monolayers were washed twice with protein-free DMEM medium (200 ml) and further incubated for 48 h 
with protein-free DMEM medium (500 ml). The conditioned culture supernatants (1-2 I) and respective 
supernatants from untransfected BHK cells were cleared by centrifugation (9000 rpm. 30 mm) and 
microfiltered through a 0.45 um membrane (Nalgene). After addition of 1% (v/v) of 1.9 M Tr.s-HCI butter. 
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pH 8.6. the conditioned medium was absorbed to a Protein A-Sepharose column equilibrated with 50 mM 
Tris-HCl pH 8.6 buffer containing 150 mN NaCI (4 - C). The loaded column was washed with 10 column 
volumes of equilibration buffer. Elution of the CD4-lgGl fusion protein CD4E 7 1 was achieved with 0.1 M 
sodium citrate buffer. pH 3, followed by immediate neutralization of the column efflux to pH 8 by Tris-base. 

5 The peak fractions were pooled, and the pool was analyzed on a Coomassie blue stained SDS-gel resulting 
in a band of the expected size (80 KDa). and which reacted with a polyclonal anti-human IgG heavy chain 
antibody and a mouse monoclonal anti-CD4 anti body (BMA040, Behringwerke) in Western Blots. The 
yields of purified fusion proteins obtained by the given procedure is 5-18 mg/24 h/l culture supernatant. The 
respective value for a BHK clone mixture (about 80 resistant clones; BHK-NK1) as described above was 2-3 

io mg/24 h/l. 



Example 9: Physical and Biological Characterization of the CD4E 7 1 Fusion Protein 



is As proven by SDS-electrophoresis on 10-15% gradient gels (Phast-System. Pharmacia) under non- 
reductive conditions, the CD4E 7 1 fusion protein migrates at the position of a homodimer (about 160 KDa) 
like a non-reduced mouse monoclonal antibody. This result is supported by analytical equilibrium ultracen- 
trifugation, where the fusion protein behaves as a homogeneous dimeric molecule of about 150 KDa. The 
absorbance coefficient of the protein was determined as A 28 o s 18 cm 2 /mg using the quantitative protein 

so determination according to Bradford ( Anal. Biochem. 72:248-254 (1976)). 

The CD4E7l-fusion protein shows specific complex formation with a solubilized £gal-gpl20 fusion 
protein (pMB!790: Broker et al., Behring Inst. Res. Commun. 82:338-348 (1988)) expressed in E. coli. In this 
protein (110 KDa). a majorpart of the HIV gp120 protein (VaUs-Trp^e) is fused to /9-ga!actosidase (amino 
acids 1-375). In a control experiment a 67-KDa 0-gal-HIV 3orf fusion protein (0gah-375; 3 orf Prol4- 

25 Asp123) showed no complex formation. En these experiments, the CD4E7I -protein was incubated with the 
respective fusion protein in molar rations of about 5:1. The complex was isolated by binding to Protein A- 
Sepharose and the Protein A-Sepharose bound proteins-together with relevant controls-were analyzed on 
10-15% gradient SDS-gels (Phast-System, Pharmacia). 

The CD4E 7 1 fusion protein binds to the surface of HIV (HIV1/HTLV-IIIB) infected cultured T4- 

30 lymphocytes as determined by direct immunofluorescence with fluorescein-isothiocyanate (FITC) labeled 
CD4E 7 1 protein. It blocks syncytia formation in cultured T4-lymphocytes upon HIV infection (0.25 TCID/cell) 
at a concentration of 10 ug/ml. Furthermore. HIV-infected cultured T4-lymphocytes (subclone of cell line 
H9) are selectively killed upon incubation with CD4E 7 l in the presence or absence of complement: To a 
highly (>50%) HIV infected culture of T4-lymphocytes (10 6 celts/ml) 50, 10 or 1 ug/ml CD4E 7 1 fusion 

35 protein was added in the presence or absence of guinea pig complement. Cells were observed for specific 
killing by the fusion protein, which is defined by the percentage of killed cells after 3 days in relation to 
viable cells in the culture at the beginning of the experiment corrected by the values for unspecific killing 
observed in control cultures. lacking the CD4E 7 1 fusion protein (Table |. Experiment 1). Surprisingly, 
addition of CD4E 7 1 protein to the infected T4 cells in the absence of complement resulted in similar 

40 specific killing rates as in the presence of complement (Table 5, Experiment II). This result demonstrates a 
complement independent cytolytic effect of CD4Eyl on HIV infected T-lymphocytes in culture. 

Table 5 



45 


No. 


Assay System 


Specific 




Experiment 




Killing (%) 




I 


non-infected T4-cells + 50 ug/ml CD4E 7 I + Compl. 


0.7 






infected T4-cetls + 50 ug/ml CD4E 7 I + Compl. 


35.1 


50 




infected T4-cells + 10 ug/ml CD4E 7 I ♦ Compl. 


25.1 






infected T4-cells ♦ 1 ug/ml CD4E 7 l + Compl. 


25 




II 


infected T4-cells + 10 ug/ml CD4E 7 I + Compl. 


49.9 






infected T4-cells + 10 ug/ml CD4E 7 I ♦ Compl. 


69.4 



Having now fully described this invention, it will be appreciated by those skilled in the art that the same 
can be performed with any wide range of equivalent parameters of composition, conditions, and methods of 
preparing such fusion proteins without departing from the spirit or scope of the invention or any 
embodiment thereof. 
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Claims 

1. A fusion protein gene comprising 1) the DNA sequence of CD4, or fragment thereof which binds to 
HIV gpi20. and 2) the DNA sequence of an immunoglobulin heavy chain, charactenzea in that the DNA 

5 sequence which encodes the variable region of said immunoglobulin chain has been replaced with the DNA 
sequence which encodes CD4, or said gpi20 binding fragment thereof. 

2. The fusion protein gene of claim 1. wherein the DNA sequence which encodes said fragment of CD4 
comprises the following DNA sequence: 



10 



/5 



20 



25 



30 



35 



40 



CAATCAACCGCG 

120 

GTTACT7GGCCC 



GAC7CCC7777ACCCAC77GC77C7GC7CC7GCAAC7CGCGC7CC7CCCAGCACCCAC7C 
C7CACGGAAAA7CCG7CAACGAACACCACGACC77CACCCCCACCAGGG7CG7CGC7CAC 
AGCGAAAGAAAG7GG7CC7GGGCAAAAAAGCGGA7ACAG7CGAAC7GACC7G7ACAGC77 

. ♦ 4- + ^ 

TCCC777C777CACCACCACCCCT7T777CCCC7A7C7CACCT7CAC7GGACA7C7CCAA 



CCCACAAGAACAGCA7ACAA77CCAC7GCAAAAAC7CCAACCACATAAAGA77C7CGGAA 

241 * * ♦ * * 300 

CCC7C77C7TC7CC7A7C7TAACC7GACCTT7T7GACC7TCG7C7A777CTAACACCC7T 

A7CACGCC7CC77C77AAC7AAACC7CCA7CCAACC7GAA7CA7CCCGC7CAC7CAACAA 

301 - * ♦ * ♦ 360 

7AC7CCCCAGGAACAA77GA7T7CCAGC7ACCT7CCAC77AC7ACCGCCAC7CAC77C77 

GAACCC777GCGACCAAGGAAAC77CCCCC7CA7CA7CAACAA7C77AAGA7ACAAGAC7 

361 * ♦ ♦ * 420 

C77CCCAAACCC7GC77CC777CAACCGGGAC7AC7AC7TC7TACAA77C7A7C77C7CA 



CAGATACT7ACA7CTC7CAAC7CGAGCACCAGAAGCACCACCTCCAAT7CCTACTG77CC 

421 ♦ * ♦ 4 * 480 

'5 C7C7A7CAA7G7ACACAC77CACC7CC7CC7C77CC7CC7CCACCT7AACGA7CACAAGC 

CA77CACTCCCAAC7CTCACACCCACCTCCTTC 

481 ♦ * 

50 C7AAC7GACCC77GAGAC7C7CGG7CCACGAAC 

or a degenerate variant thereof, or the following DNA sequence: 

55 
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C A ATC AACCCCG 

120 

5 CTTACTTGGCCC 

CAGTCCCTTTTAGCCACTTGCTTCTGGTGCTGCAACTGGCGCTCCTCCCAGCACCCACTC 

121 ♦ ♦ ♦ ♦ ♦ 180 

TO C7CACGGAAAA7CCC7CAACCAACACCACCACG77GACCCCCAGCAGGC7CC7CGC7CAC 

AGGGAAAGAAACTGG7GCTCGGCAAAAAAGGGGATACACTGGAACTCACCTGTACAGCTT 

181 + + * * * 240 

75 7CCC777C777CACCACGACCCC777777CCCC7A7G7CACCT7GAC7GGACA7C7CGAA 

CCCACAACAACAGCA7ACAA77CCAC7CGAAAAAC7CCAACCAGA7AAAGA77C7CGCAA 

241 ♦ + ♦ ♦ ♦ 300 

20 GCC7C77C7TC7CC7A7C77AAGG7CACC77TTTCAGCTTGC7C7A777C7AACACCCT7 

ATCAGCGCTCCTTCTTAACTAAACC7CCATCCAAGCTGAATCATCGCGCTCACTCAACAA 

301 ♦ * * * ♦ 360 

25 TAGTCCCCAGGAAGAATTGATTTCCAGG7AGGTTCCACTTACTACCGCGACTGAGTTCTT 

GAAGCC777GGGACCAAGGAAAC77CCCCC7GA7CATCAAGAA7C77AAGA7AGAACAC7 

361 ♦ ♦ * ♦ ♦ 420 

3Q C77CCCAAACCC7GG77CCT77GAAGGGGGAC7AG7AGTTCT7AGAATTC7A7CT7C7GA 

CAGA7AC77ACA7C7C7CAAC7GCACGACCAGAACCAGGAGG7GCAAT7CC7AG7C77CG 

421 ♦ * * * ♦ 480 

35 C7C7A7CAA7G7ACACAC77CACC7CC7GG7C77CC7CC7CCACG77AACGA7CACAACC 



CA77CAC7GCCAAC7C7GACACCCACC7GC77CAGGGCCAGAGCC7CACCC7GACC77GG 

40 481 * ♦ ♦ ♦ * 540 

C7AAC7CACGG77CACAC7C7CCC7CCACGAAG7CCCCG7C7CCCAC7CGGAC7CGAACC 

ACACCCCCCC7CG7AC7AGCCCC7CAG7GCAA7C7AGGAC7CCAAGGGG7AAAAACA7AC 

541 ♦ ♦ * ♦ ♦ 600 

7C7CGCCCGGACCA7CA7CCCCCAG7CACC77ACA7CC7CAGG77CCCCA77T77G7A7G 

AGGCCGGCAAGACCC7C7CCC7C7C7CAG 

50 6 01 ♦ ♦ 

7CCCCCCC77C7CCGACACGCACAGAG7C 

or a degenerate variant thereof. 
55 3. The fusion protein gene of claim 1 or 2. characterized in that said immunoglobulin chain is of the 



class igM, igGl or lgG3. 
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4. A fusion protein gene comprising 1) the DNA sequence of CD4, or fragment thereof which binds to 
HIV gpl20. and 2) the DNA sequence of an immunoglobulin light chain, characterized in that the DNA 
sequence which encodes the variable region of said .mmunoglobulin light chain has been replaced with the 
DNA sequence which encodes CD4. or HIV gpl20-binding fragment thereof. 
5 5. A fusion protein gene of claim 4. characterized in that the DNA sequence which encodes said 
fragment of CD4 comprises the following DNA sequence: 

CAATGAACCGGG 

120 

C77AC77GCCCC 



70 



CAC7CCC77T7ACGCAC77GC77C7CC7CC7CCAAC7GGCCC7CC7CCCACCACCCAC7C 

121 * * * 

75 C7CACGGAAAA7CCC7CAACCAACACCACGACC77GACCCCGACGAGGG7CG7CGG7GAC 



20 



2S 



30 



35 



ACCCAAACAAAC7GC7GC7GGGCAAAAAACCGCA7ACAC7GGAAC7CACC7C7ACACC77 
7CCC777C777CACCACGACCCG777777CCCC7A7C7CACC77GAC7CCACA7C7CGAA 
CCCACAACAACACCA7ACAA77CCAC7CGAAAAAC7CCAACCACA7AAACA7TC7GGGAA 
GCC7C77C77C7CC7A7G77AACG7GACC77777CAGC77CG7C7A777C7AAGACCC77 
A7CAGCGC7CC77C77AAC7AAAGC7CCA7CCAAGC7GAA7GA7CGCGC7CAC7CAAGAA 
7AG7CCCCAC0AACAA77CA777CCAGC7AGC77CCAC77AC7AGCCCGAC7GAC77C77 
CAAGCC777GGCACCAACCAAAC77CCCCC7CA7CA7CAACAA7C77AACA7ACAACAC7 
CT7CGGAAACCC7GG77CC777CAACGCCCAC7AG7AGT7C^rAGAA77C7A7C77C7GA 



40 CAGA7AC77ACA7C7G7GAAG7GCAGCACCAGAAGCACGACC7GCAA77CC7AG7C77CC 

421 * * * 

C7C7A7GAA7G7AGACAC77CACC7CC7GG7C77CC7CC7CCACG77AACGA7CACAACC 



45 CA77CAC7CCCAAC7C7CACA0CCACCTCC77C 

481 ♦ * 

C7AAC7GACGG77CACAC7C7GCC7CCACCAAG 



50 



or a degenerate variant thereof, or the following DNA sequence: 
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CAATCAACCGCG 

120 

CTTACTTCGCCC 



CAG7CCC7777ACCCAC77CC77C7CG7CC7CCAAC7CGCCC7CC7CCCAGCACCCAC7C 

121 ♦ ♦ ♦ ♦ ♦ 180 

C7CACGGAAAA7CCC7CAACGAACACCACCACC77CACCGCCAGCAGCG7CG7CCC7CAC 

ACGCAAAGAAAG7CC7CC7GCCCAAAAAAGGCGA7ACAC7CGAAC7GACC7C7ACACC77 

181 ♦ ♦ ♦ * ♦ 240 

TCCCTTTCTTTCACCACGACCCGTTTT7TCCCCTATGTCACCTTGAC7GGACATGTCCAA 

CCCAGAACAACACCA7ACAA77CCAC7CGAAAAAC7CCAACCACA7AAAGA77C7GCCAA 

241 ♦ ♦ ♦ * 300 

GGCTCTTCTTCTCCTATCTTAACGTGACCTTTTTCAGGTTGGTCTATTTCTAAGACCCTT 

A7CACCCC7CC7TC77AAC7AAACC7CCA7CCAACC7CAA7CA7CCCGC7CAC7CAACAA 

301 ♦ + ♦ + 360 

7AG7CCCCACCAACAA77CA777CCAGG7AGG77CGAC77AC7AGCGCCAC7CAC77C77 

GAAGCCTTTGGGACCAACGAAACTTCCCCCTGATCATCAACAATCTTAAGATAGAAGACT 

361 ♦ ♦ ♦ * 420 

C77CGGAAACCC7GC77CC777CAACCGGCAC7AC7AG77C77AGAA77C7A7C77C7CA 

CACATACTTACATCTGTGAACTCGAGGACCACAAGGAGGAGGTGCAATTGCTAGTGTTCG 

421 ♦ ♦ ♦ * * 480 

GTCTA7CAATG7AGACACTTCACC7CCTGCTCTTCCTCCTCCACCTTAACGATCACAAGC 

GAT7GAC7GCCAAC7C7GACACCCACC7GC77CACGGGCACAGCC7cfACCC7CACC77GG 

481 ♦ * ♦ * - 540 

C7AAC7CACGG77GAGAC7G7GGG7GCACGAAC7CCCCC7C7CCGAC7GGCAC7CCAACC 

AGACCCCCCC7GC7AG7AGCCCC7CAC7CCAA7G7ACCAC7CCAACCGG7AAAAACA7AC 
541 ♦ ♦ ♦ ♦ ♦ 600 



7C7CCGCGGCACCA7CA7CGCCCAG7CACC77ACA7CC7CACG77CCCCA77777G7A7C 

AGGGGGCCAAGACCC7C7CCG7C7C7CAG 

601 ♦ ♦ 

7CCCCCCC77C7CGGAGAGGCACAGAC7C 

a degenerate variant thereof. 
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30 



35 
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6 A vector comprising the fusion protein gene of claim i. preferably having the identifying character.s- 
tics of P CD4H 7 1 which has been deposited unoer Access.on No. 67611. or pCD4Mu. which has been 
deposited under Accession No. 67608. or of pCD4Pu. which has been deposited under Accession No. 
67609. or of pCD4E 7 l. which has been deposited under Accession No. 67610, all in E. coU at the ATCC 
under the terms of the Budapest Treaty. 

7. A vector comprising the fusion protein gene of claim 4. 

8 A host transformed with the vector of claim 6 or 7. 

9 The host of claim 8 which expresses an immunoglobulin light chain together with the expression 
product of said fusion protein gene to give an immunoglobulin-like molecule which binds to gpl20 or an 
immunoglobulin heavy chain together with the expression product of said fusion protein gene to give an 
immunoglobulin-like molecule which binds to HIV or SIV gpl20. 

10. The host of claim 9. wherein said immunoglobulin heavy chain is of the immunoglobulin class IgM. 

IgGl or lgG3. 

11. A method of producing a fusion protein comprising CD4, or fragment thereof which binds to gpl20. 
and immunoglobulin heavy chain, wherein the variable region of the immunoglobulin chain has been 
substituted with CD4, or fragment thereof which binds to HIV or SIV gpi20. characterized by cultivating in a 
nutrient medium under protein-producing conditions, a host strain transformed with the vector of claim 6. 
said vector further comprising expression signals which are recognized by said host strain and direct 
expression of said fusion protein, and recovering the fusion protein so produced. 

12. The method of claim 11. wherein said host strain is a myeloma cell line which produces 
immunoglobulin light chains and said fusion protein comprises an immunoglobulin heavy chain of the class 
IgM. IgGl or lgG3. wherein an immunoglobulin-like molecule comprising said fusion protein is produced. 

13. A method of producing a fusion protein comprising CD4. or fragment thereof which binds to gpi20. 
and an immunoglobulin light chain, wherein the variable region of the immunoglobulin chain has been 
substituted with CD4. or fragment thereof which binds to HIV or SIV gpl20. characterized by cultivating in a 
nutrient medium under protein-producing conditions, a host strain transformed with the vector of claim 7, 
said vector further comprising expression signals which are recognized by said host strain and direct 
expression of said fusion protein, and recovering the fusion protein so produced. 

14. The method of claim 13. wherein said host produces immunoglobulin heavy chains of the class 
IgM. IgGl and lgG3 together with said fusion protein to give an immunoglobulin-like molecule which binds 
to HIV-gpl20. 

15. A fusion protein, which is preferably detectably labeled, comprising CD4. or fragment thereof which 
is capable of binding to HIV or SIV gpl20. fused at the C-terminus to a second protein which comprises an 
immunoglobulin heavy chain of the class IgM. IgGl or lgG3. wherein the variable region of said heavy chain 
immunoglobulin has been replaced with CD4. or HIV gpl20-binding fragment thereof, and preferably further 
comprising a therapeutic agent, radiolabel or NMR imaging agent linked to said fusion protein. 

16. The fusion proteins CD4H 7 1, CD4Mu. CD4Pu, CD4E 7 1 or CD4B-yl . 

17. An immunoglobulin-like molecule, comprising the fusion protein of claim 15 and an immunoglobulin 
light chain, preferably further comprising a detectable label, and especially further comprising a therapeutic 
agent, radiolabel or NMR imaging agent linked to said immunoglobulin-like molecule. 

18. A fusion protein comprising CD4. or fragment thereof which binds to HIV gpl20. fused at the C- 
terminus to a second protein comprising an immunoglobulin light chain where the variable region has been 
deleted, and which fusion protein preferably is detectably labeled, especially further comprising a therapeu- 
tic agent, radiolabel or NMR imaging agent linked to said fusion protein. 

19. The fusion protein of claim 15. wherein said CD4 fragment comprises the following amino acid 

sequence: 
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is or the following amino acid sequence: 
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20. An immunoglobulin-like molecule comprising the fusion protein of claim 18 and an immunoglobulin 
heavy chain of the class IgM, IgGi or lgG3. preferably further comprising a detectable label, and especially 
further comprising a therapeutic agent, radiolabel or NMR imaging agent linked to said immunoglobulin-like 
molecule. 

21. A complex comprising the fusion protein of claim 15 or 18 and HIV or SIV gpi20. 

22. The complex of claim 21. wherein said gpl20 is a part of an Hf/ or SIV. is expressed on the 
surface of an HIV or SIV-infected cell or is present in solution. 

23. A method for the detection of HIV or SIV gpi20 in a sample, characterized by 

(a) contacting a sample suspected of containing HIV or SIV gpl20 with the fusion protein of claim 15 
or 18. and 

(b) detecting whether a complex is formed, said fusion protein preferably being detectably labeled. 



Claims for the following Contracting State: GR 

1. A vector comprising a fusion protein gene comprising 1) the DNA sequence of CD4, or fragment 
50 thereof which binds to HIV gpi20. and 2) the DNA sequence of an immunoglobulin heavy chain, 

characterized in that the DNA sequence which encodes the variable region of said immunoglobulin chain 
has been replaced with the DNA sequence which encodes CD4. or said gpi20 binding fragment thereof. 

2. The vector of claim 1. having the identifying characteristics of pCD4H 7 1. which has been deposited 
in E. coli at the ATCC under the terms of the Budapest Treaty under Accession No. 67611. 

55 3. The vector of claim 1. having the identifying characteristics of pCD4Mu. which has been deposited in 
E. coli at the ATCC under the terms of this Budapest Treaty under Accession No. 67608. 

4. The vector of claim 1, having the identifying characteristics of PCD4Pu, which has been deposited in 
E:. coli at the ATCC under the Budapest Treaty under Accession No. 67609. 
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5. The vector ol claim 1, having the identifying characteristics of PC4E>1. which has been deposited in 
E. coli at the ATCC unoer the terms of the Budapest Treaty under Accession No. 67610. 

6. A vector comprising a fusion protein gene characterized by 1) the DNA sequence of CD4. or 
fragment thereof which binds to HIV gpl20, and 2) the DNA sequence of an immunoglobulin light chain. 

5 wherein the DNA sequence which encodes the variable region of said immunoglobulin light chain has been 
replaced with the DNA sequence which encodes CD4. or HIV gpl20-binding fragment thereof. 

7. A host transformed with the vector of claim i. 

8. The host of claim 7 which expresses an immunoglobulin light chain together with the expression 
product of said fusion protein gene to give an immunogiobuiin-like molecule which binds to gpi20. 

w 9. A host transformed with the vector of claim 6. 

10. The host of claim 6 which expresses an immunoglobulin heavy chain together with the expression 
product of said fusion protein gene to give an immunoglobulin-like molecule which binds to HIV or SIV 
9P120. 

11. The host of claim 10. characterized in that said immunoglobulin heavy chain is of the im- 
75 munoglobulin class IgM, IgGi or lgG3. 

12. A method of producing a fusion protein comprising CD4. or fragment thereof which binds to gpl20, 
and an immunoglobulin heavy chain, wherein the variable region of the immunoglobulin chain has been 
substituted with CD4, or fragment thereof which binds to HIV or SIV gpl20. characterized by cultivating in a 
nutrient medium under protein-producing conditions, a host strain transformed with the vector of claim 1, 

20 said vector further comprising expression signals which are recognized by said host strain and direct 
expression of said fusion protein, and recovering the fusion protein so produced. 

13. The method of claim 12. characterized in that said host strain is a myeloma cell line which produces 
immunoglobulin light chains and said fusion protein comprises an immunoglobulin heavy chain of the class 
IgM. IgGi or lgG3. wherein an immunoglobulin-like molecule comprising said fusion protein is produced. 

25 14. A method of producing a fusion protein comprising CD4, or fragment thereof which binds to gpl20. 
and an immunoglobulin light chain, wherein the variable region of the immunoglobulin chain has been 
substituted with CD4. or fragment thereof which binds to HIV or SIV gpl20. characterized by cultivating in a 
nutrient medium under protein-producing conditions, a host strain transformed with the vector of claim 6. 
said vector further comprising expression signals which are recognized by said host strain and direct 

30 expression of said fusion protein, and recovering the fusion protein so produced. 

15. The method of claim 14. characterized in that said host produces immunoglobulin heavy chains of 
the class IgM, IgGi and lgG3 together with said fusion protein to give an immunoglobulin-like molecule 
which binds to HIV-gpl20. 

16. A method for the detection of HIV or SIV gpi20 in a sample, characterized by 

35 (a) contacting a sample suspected of containing HIV or SIV gpi20 with a fusion protein comprising 

CD4. or fragment thereof which binds to HIV gpl20. and 2) an immunoglobulin heavy chain, wherein the 
variable region of said immunoglobulin chain has been replaced with CD4. or said gp120 binding fragment 
thereof, and 

(b) detecting whether a complex is formed. f 

40 

17. The method of claim 16. characterized in that said fusion protein is detectably labeled. 

18. A method for the detection of HIV or SIV gpi20 in a sample, characterized by 

(a) contacting a sample suspected of containing HIV or SIV gpl20 with a fusion protein comprising 
comprising 1) CD4. or fragment thereof which binds to HIV gp120. and 2) an immunoglobulin light chain, 

45 wherein the variable region of said immunoglobulin light chain has been replaced with CD4. or HIV gpl20- 
binding fragment thereof, and 

(b) detecting whether a complex has formed. 

19. The method of claim 18. characterized in that said fusion protein is detectably labeled. 

50 

Claims for the following Contracting State: ES 

1. A method of producing a fusion protein comprising CD4, or fragment thereof which binds to gpl20. 
55 and an immunoglobulin heavy chain, wherein the variable region of the immunoglobulin chain has been 
substituted with CD4, or fragment thereof which binds to HIV or SIV gpl20. characterized by cultivating in a 
nutrient medium under protein-producing conditions, a host strain transformed with a vector comprising a 
fusion protein gene comprising 1) the ONA sequence of C04. or fragment thereof which binds to HIV 
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gpl20. and 2) the ONA sequence of an immunoglobulin heavy chain, wherein the DNA sequence which 
encodes the variable region of said immunoglobulin chain has been replaced with the DNA sequence which 
encodes CD4. or said gpl20 binding fragment thereof, said vector further comprising expression signals 
which are recognized by said host strain and direct expression of said fusion protein, and recovering the 
5 fusion protein so produced. 

2. The method of claim 1. characterized in that said vector has the identifying characteristics of 
pCD4H 7 l. which has been deposited in E. coli at the ATCC under the terms of the Budapest Treaty under 
Accession No. 67611. 

3. The method of claim 1. characterized in that said vector has the identifying characteristics of 
jo PCD4Mu. which has been deposited in E. coli at the ATCC under the terms of this Budapest Treaty under 

Accession No. 67608. 

4. The method of claim 1, characterized in that said vector has the identifying characteristics of 
PCD4Pu, which has been deposited in E. coli at the ATCC under the Budapest Treaty under Accession No. 
67609. 

js 5. The method of claim 1, characterized in that said vector has the identifying characteristics of 
pCD4E 7 l. which has been deposited in E. coli at the ATCC under the terms of the Budapest Treaty under 
Accession No. 67610. 

6. The method of claim 1, characterized in that said host strain is a myeloma cell line which produces 
immunoglobulin light chains and said fusion protein comprises an immunoglobulin heavy chain of the class 

20 IgM, IgGl or lgG3. wherein an immunoglobulin-like molecule comprising said fusion protein is produced. 

7. A method of producing a fusion protein comprising CD4, or fragment thereof which binds to gpl20. 
and an immunoglobulin light chain, wherein the variable region of the immunoglobulin chain has been 
substituted with CD4, or fragment thereof which binds to HIV or SIV gpl20. characterized by cultivating in a 
nutrient medium under protein-producing conditions, a host strain transformed with a vector comprising a 

25 fusion protein gene comprising 1) the DNA sequence of CD4, or fragment thereof which binds to HIV 
gpl20. and 2) the DNA sequence of an immunoglobulin light chain, wherein the DNA sequence which 
encodes the variable region of said immunoglobulin light chain has been replaced with the DNA sequence 
which encodes CD4, or HIV gpl20-binding fragment thereof, said vector further comprising expression 
signals which are recognized by said host strain and direct expression of said fusion protein, and recovering 

30 the fusion protein so produced. 

8. The method any one of claims 1 or 7. characterized in that the DNA sequence which encodes said 
fragment of CD4 comprises the following DNA sequence: 
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CAATCAACCCGG 

120 

5 GTTAC7TCCCCC 

GAGTCCCTTTTAGGCACTTCCTTCTCCTGCTGCAACTCCCCCTCCTCCCACCAGCCACTC 

121 ♦ * ♦ ♦ ♦ 180 

W C7CACGCAAAA7CCGTGAACCAAGACCACGACCTTGACCCCCACCACGCTCG7CGGTGAC 

AGCGAAAGAAAG7CG7CC7GGGCAAAAAAGGGGA7ACAG7GGAAC7CACC7G7ACAGC77 

181 * + ♦ * 240 

75 TCCCTTTC77TCACCACCACCCC7TTTTTCCCCTATCTCACCTTCACTCCACATCTCCAA 

CCCACAACAACAGCA7ACAA77CCAC7CCAAAAAC7CCAACCAGA7AAAGATTC7CCCAA 

241 ♦ * ♦ ♦ 300 

2Q CGG7C77C77C7CG7A7G7TAAGC7GACC7777TGAGG7TGGTC7ATT7C7AACACCCT7 

A7CACGGC7CC77C77AAC7AAAGG7CCA7CCAAGC7GAA7GA7CGCCC7GAC7CAAGAA 

301 ♦ ♦ * ♦ ♦ 360 

?5 7AG7CCCGACGAACAA77GA777CCAGG7ACG77CCAC77ACTAGCCCGAC7GAC77C77 

GAAGCC777CGGACCAACCAAAC77CCCCC7CA7CA7CAACAATC7TAAGA7ACAACAC7 

361 + ♦ ♦ + + 420 

3Q CTTCCCAAACCCTCGT7CCTTTCAAGCCGCACTACTAGTTCTTACAATTCTATCTTCTCA 

CACA7ACTTACATC7GTGAACTGGACCACCACAAGCAGCACCTCCAATTGCTAGTCTTCC 

421 * + ♦ ♦ 480 

35 G7C7A7CAA7G7ACACAC77CACC7CC7CG7C7TCC7CCTCCACG77AACCA7CACAAGC 

CA77GAC7GCCAAC7C7GACACCCACC7CCTTC 

481 ♦ | 

C7AAC7GACGG7TGAGAC7C7GCG7GGACGAAG 

40 

or a degenerate variant thereof. 

9. The method of any one of claims 1 or 7, characterized in that said DNA sequence which encodes 
45 said fragment of CD4 comprises the following DNA sequence: 

50 
55 
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CAATGAACCCGG 

120 

5 

. GTTACTTGGCCC 

GAC7CCCTTT7AGGCAC77GC77C7CC7GC7CCAAC7GCCCC7CC7CCCAGCAGCCAC7C 

121 ♦ ♦ * ♦ ♦ 180 

10 C7CAGGGAAAATCCG7GAACGAAGACCACCACCTTCACCGCCACGACGGTCCTCGGTGAC 

AGGGAAACAAAG7GG7GC7GGCCAAAAAAGGCGA7ACAG7GGAAC7CACC7G7ACACC77 

181 + ♦ ♦ ♦ 240 

75 TCCCTTTCTTTCACCACGACCCGTTTTTTCCCCTATCTCACCTTGACTGGACATG7CCAA 

CCCAGAAGAAGACCATACAATTCCACTGGAAAAACTCCAACCAGATAAAGATTCTCCGAA 

241 ♦ ♦ ♦ ♦ ♦ 300 

20 GGGTCTTCTTCTCGTATGTTAAGGTGACCTTTTTGAGCTTGGTCTATTTCTAAGACCCT7 

ATCAGGGCTCCTTCTTAACTAAAGGTCCATCCAACCTGAATGATCGCGCTCACTCAAGAA 

301 + ♦ ♦ + 360 

25 TAGTCCCCAGGAAGAATTCATTTCCAGGTAGGT7CGACTTACTAGCCCGACTGAGTTCTT 

GAAGCCTTTGGCACCAAGGAAACTTCCCCCTGATCATCAAGAATC7TAAGATACAAGACT 

361 * ♦ ♦ ♦ * 420 

30 CTTCGGAAACCCTGCTTCCTTTGAAGGGGGACTAGTACTTCTTAGAATTCTATCTTCTGA 

CAGA7AC77ACA7C7C7CAAG7CGAGGACCAGAAGGAGCACG7GCAA77GC7AG7C77CC 

421 ♦ ♦ ♦ ♦ ♦ 480 

35 G7C7A7GAA7C7AGACAC77CACC7CC7GG7C77CC7CC7CCACG77AACCA7CACAAGC 

CAT7CAC7GCCAAC7C7CACACCCACC7GC77CAGGGCCAGAGCC7GACCC7CACC77GC 

481 ♦ ♦ ♦ ♦ 540 

40 C7AAC7GACCC77CACAC7G7CCG7GCACCAAG7CCCCG7C7CGGAC7GGGAC7GGAACC 

ACACCCCCCC7CC7AC7ACCCCC7CAC7CCAA7C7ACCAC7CCAAGCGC7AAAAACATAC 

541 ♦ ♦ ♦ * ♦ ' 600 ^ 

45 7C7CGCGGGCACCA7CA7CGGGCAG7CACGT7ACA7CC7CAGG77CCCCA77777C7A7G * : • . 

AGGGGGCCAAGACCC7C7CCG7C7C7CAC 

601 * * 

50 7CCCCCCC77C7CGCACAGGCACAGAC7C 



or a degenerate variant thereof. 

10. The method of claim 7, characterized in that said host produces immuno-globulin heavy chains of 
the class IgM. igGl and lgG3 together with said fusion protein to give an immunoglobulin-like molecule 
which binds to HIV-gpl20. 
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